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(2) INFORMATION FOR SEQ ID NO; 20: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 741 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TCTTGAAGAG TGTACAGTAC AGGATTATTA TAATGAAAGT TT ATATCAAC AGGGTTTCGT 60 
15 TGGCTCTGCA TATATTATAA GCAAAAGAGA TTGGTAAAGT GCCACAGTAT TCCAGATAAC 120 
TTTTCAGTTG CGGCCTTTCT TCTCGTTCTT TAATTTGAAA CCTAGATACA TGCAGTAAAA 180 
ACTAGGAGAA TGACTTTTAC CCTTGGGGAC AGCCAAGTTT TGTTGATAAA CCTATTTCCT 240 

20 

AGCATGCCTT CAGGAAGTTG TGCCAGACCC TAGATTGTGA AGGACCCACT GTTCTTCTGT 300 
TGTACGAGCT CCCTGAACCA TTGTTCAGAG GACCAATGTC ACATCGCTTC ATGGGCATGG 360 
25 NCCATGGGAG CATCTGGGTG ATAYCTGTCT ACAGTATTGG CTCTTCTGCG AGGCTGATAC 420 
ACAAGGCCTC TCTTCCACAT GATCATTTGC AAACCTCCCC CAGCCCCTAC CATCCAATGT 480 
GGAAGGAAAA CAAGAACTGC CTGAAGAAGA GTCCAAGCTA CAGATACACA GCGTGTGCAT 540 

30 

TGCGGCTGTC ACCTTCCTCC TCCCACTTCT GTATCCTCAG AGATGCTGCG TGGATGTTTC 600 
CTTAACCTCA GCTGACTTCC CTGTGAATGT CTAATGCTAG TTCAGGGCCT CCAGGCATTG 660 
35 ATTTGTACAG TGGTAACTCC CAATGAGGCT TCTGTTATCA TTTGGTGTGC 1TTYTCTGTC 720 
ATTAAAAGAA ATGATTTTCC C - - - 741 

40 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
45 <A) LENGTH: 991 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI^JESS : double 

(D) TOPOLOGY: linear 

50 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GGCACGAGTC TCCCCTGGGG AAGTTTTTCT TTTTCAGGAG GGAGGAQGGC TTTCCCAGGT 60 
AATGTGTCTA GAGTGTTGGG CAGAAAATCT GGGACCACAC CACACCAGTT CTCTCCTTAA 120 

55 . 

TCCACGTCAT TTGCCTTCTA TCCCAGCTAT GTTTCCAGTG TCCTCTGGGT GTTTCCAAGA 180 
GCAACAAGAA ATGAATAAAT CTCTGGTGAG TTGTTTATTT GTTCTTCACT TTGTTTTACA 240 
60 CTCTATTTTC TGAGTTTATG GGTGTCTGTG AATTAAAAAG GAAAAGTAGA AATAAGTAAA 300 
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ACTCAGGTTG AAGGAAATAT ACATAAATAA GATAAAGCTG ACCTGTAGAT ATAGCAGGTT 360 

^ ATAAAGCTTA GAGTTGTCTA AGTTGAGTGC AAATTTTCCT CTGATCTTTC TGATCCCXSAA 420 

CAAAAAAGCA GTCATGTTTG TTATCTGATT GGAATGGAAC CCGAGAAGAG AGCATGCOCT 480 

GTOCTTGTCG GACAGGAAAG CTTGCGTGCA CCAAGTCTGA ACCACCACCT OCATCGrcAC 540 

10 ATAGATTATG TGCTGGAACA TATTTCACAC CGGCCTGGCA GTAAACACTT CTAGTOTICT 600 

GCAGTGGAAA CGGTCATCTT CCX^CTAAAGC ACGGCGTGTT GTGCAGCGGA AATCGTCATC 660 

TGCTGCTAAA ACACAGCTTC CATCGTAATG TATGCTCCTT ACTCAAAGAG TCTCGTCCCA 720 

AACAGCCTTT GGGAGGTCCT CCTTGATTCA TGGATGAAAC COXSGAACATC TIGAGGACTC 780 

AGTTAACCAT AGGTCCTTAA ATAACTCTCC ACACGTTTTT CTTAGTTTAT CTCTACATCC 840 

AGGGTGTGCA GCAGCCTGTT CAAAGTCATA TTTTCTGGGA AATATTTCCA GTCTTTATTT 900 

GCACTTTAGC CCACTCTGTG TAGCCTTATT TCTTCTAAAC TCACCATTAA TCTGAATAAT 960 

AGTCAAATTT AGGGGGACTG TATTTGCCTT A 99X 



15 



20 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 
35 (D) TOPOLOGY: linear 



40 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CCACGCGTCC GGAATTCCCC TGAGGATCTT QGGCTATCTT TGACAGGGGA TTCTTCCAAG 60 

TTGATGCTTT CTACAAGTGA ATATAGTCAG TCCCCAAAGA TQGAGAGCIT GAGTTCTCAC 120 

AGAATTGATG AAGATGGAGA AAACACACAG ATTGAGGATA CGGAACCCAT GTCTCCAGTT 180 

45 CTCAATTCTA AATTTGTTCC TGCTGAAAAT GATAGTATCC TGATCAATCC AGCACAGGAT 240 

GGTGAAGTAC AACTGAGTCA GAATGATGAC AAAACAAAGG GAGATGATAC AGACACCAGG 30O 

GATGACATTA GTATTTTAGC CACTGGTTGC AAGGGCAGAG AAGAAACGGT AGCAGAAGAA 360 

GTTTGTATTG ATCTCACTTG TGATTCGGGG AGTCAGGCAG TTCCGTCACC AGCTACTCGA 420 

TCTGAGGCAC TTTCTAGTGT GTTAGATCAG GAGGAAGCTA TGGAAATTAA AGAACACCAT 480 

55 CCAGAGGAGG GGTCTTCAGG GTCTGAGGTG GAAGAAATCC CTGAGACACC TTCTXSAAAGT 540 

CAAGGAGAGG AACTCAAAGA AGAAAATATG GAGAGTGTTC CGTTCCACCT TTCTCTCACT 600 

GAAACTCAGT CCCAAGGGTT GTGTCTTCGG AGGCATCCAA AAAAAAAAAA AAA 653 

60 
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(2) INFORMATION FOR SEQ ID NO: 23: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 
10 (D) TOPOIiOOT: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GGCAGGCTGA CGACCTGCAA GCCACAGTGG CTGCCCTGTG CGTGCTGCGA GGTGGGGGAC 60 

15 

CCTGGGCAGG AAGCTGGCTG AGCCCCAAGA CCCCGGGGGC CATGGGCGGG GATCTGGTGC 120 

TTGGCCTGGG GGCCTTGAGA CGCCGAAAGC GCTTGCTGGA GCAGGAGAAG TCTCTRGCCG 180 

20 GCTGGGCACT GGTGCTGGCA SGARCTGGCA TTGGACTCAT GGTGCTGCAT GCAGAGATGC 240 

TGTGGTTCGG GGGGTGCTCG GCTGTCAATG CCACTGGGCA CCTTTCAGAC ACACTTTGQC 300 

TGATCCCCAT CACATTCCTG ACCATCGGCT ATGGTGACGT GGTGCCGGGC ACCATGTGGG 360 

25 

GCAAGATCGT YTGCCTGTGC ACTGGAGTCA TGGGTGTCTG CTGCACAGCC CTGCTGGTQG 420 

CCGTGGTGGC CCGGAAGCTG GAGTTTAACA AGGCAGAGAA GCACGTGCAC AACTTCATGA 480 

30 TGGATATCCA GTATACCAAA GAGATGAAGG AGTCCGCTGC CCGAGTGCTA CAAGAAGCCT 540 

GGATGTTCTA CAAACATACT CGCAGGAAGG AGTCTCATGC TGCCCGCANG CATCAGCGCA 600 

ANCTGCTGGC CGCCATCAAC GCGTTCCGCC AGGTGCGGCT GAAACACCGG AAGCTCCGGG 660 

35 

AACAAGTGAA CTCCATGGTG GACATCTCCA AGATGCACAT GATCCTGTAT GACCTGCAGC 720 

AGAATCTGAG CAGCTCACAC CGGGCCCTGG AGAAACAGAT TGACACGCTG GCGGGGAAGC 780 

40 TGGATGCCCT GACTGAGCTG CTTAGCACTG CCCTGGGGCC GAGGCAGCTT CCAGAACCCA 840 

GCCAGCAGTC CAAGTAGCTG GACCCACGAG GAGGAACCAG GCTACTTTCC CCAGTACTGA 900 

GGTGGTGGAC ATCGTCTCTG CCACTCCTGA CCCAGCCCTG AACAAAGCAC CTCAAGTGCA 960 

45 

AQGACCAAAG GGGGCCCTGG CTTGGAGTGG GTTGGCTTGC TGATGGCTGC TGGAGGGGAC 1020 

GCTGGCTAAA GTGGGKAGGC CTTGGCCCAC CTGAGGCCCC AGGTGGGAAC ATGGTCACCC 1080 

50 CCACTCTGCA TACCCTCATC AAAAACACTC TCACTATGCT GCTATGGACG ACCTCCAGCT 1140 

CTCAGTTACA AGTGCAGGCG ACTGGAGGCA GGACTCCTGG GTCCCTGGGA AAGAGGGTAC 1200 

TAGGGGCCCG GATCCAGGAT TCTGGGAGGC TTCAGTTACC GCTGGCCGAG CTCAAGAACT 1260 

55 

GGGTATGAGG CTGGGGCGGG GCTGGAGGTG GCGCCCCCTG GTGGGACAAC AAAGAGGACA 1320 

CCATTTTTCC AGAGCTGCAG AGAGCACCTG GTGGGGAGGA AGAAGTGTAA CTCACCAGCC 1380 

60 TCTGCTCTTA TCTTTGTAAT AAATGTTAAA GCCAGAAAAA AATAAAAAAA AAAAAAAAAA 1440 
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AACTCGAGGG GGGCCCRKAC CCAATCWCCC TATAGTAKAC GTANNN 1486 



5 

(2) IKFOFMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 2323 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOUOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CTTCGCCGTT TCTCCTGCCA GGGGAGGTCC CGGCTTCCCG TGGAGGCTCC GGACCAAGCC 60 

CCTTCAGCTT CTCCCTCCGG ATCGATGTGC TGCCGCCGCC GCCGCCGCCG TCCCGCGTCC 120 

20 

TTCGGTCTCT GCTCCCGGGA CCCGGCTCCG CGCAGCCAGC CAGCATGTCG GGGATCAAGA 180 

AGCAAAAGAC GGAGAACCAG CAGAAATCCA CCAATGTAGT CTATCAGGCC CACCATGTGA 240 

25 GCAGGAATAA GAGAGGGCAA GTGGTTGGAA CAAGGGGTGG GTTCCGAGGA TGTACCGTGT 300 

GGCTAACAGG TCTCTCTGGT GCTGGGAAAA ACAACGATAA GTTTTGCCCT GGAGGAGTAC 360 

TTGTCTCCCA TGCCATCCCT GTTAATTCCT GGATGGGGAC AATGTCCGTC ATGGCCTTAA 420 

30 

CAGAATCCCC CAGATGGCTT CATGGCCCCC AAAGCATGGA AGGTCCTGAC AGATTATTAC 480 

AGGTCCCTGC AGAAGAACTA AGCCTTTGGT CCAGAGTTTC TTTCTGAAGT GCTCTTTGAT 540 

35 TACCTTTTCT ATTTTTATGA TTAGATGCTT TGTATTAAAT TGCTTCTCAA TGATGCATTT 600 

TAATCTTTTA TAATGAAGTA AAAC3TTGTGT CTATAATTAA AAAAATATAT ATATATATAC 660 

ACACACACAT ATACATACAA AGTCAAACTG AAGACCAAAT CTTAGCAGGT AAAAGCAATA 720 

40 

TTCTTATACA TTTCATAATA AAATTAGCTC TATGTATTTT CTACTGCACC TGAGCAGGCA 780 

GGTCCCAGAT TTCTTAAGGC TTTGTTTGAC CATGTGTCTA GTTACTTGCT GAAAAGTGAA 840 

45 TATATTTTCC AGCATGTCTT GACAACCTGT ACTCTTCCAA TGTCATTTAT CAGTTGTAAA 900 

ATATATCAGA TGTGTCCTCT TCTGTACAAT TGACAAAAAA AAAAATTTTT TTTTCTCACT 960 

CTAAAAGAGG TGTGGCTCAC ATCAAGATTC TTCCTGATAT TTTACCTCAT GCTGTACAAA 1020 

50 

GCCTTAATGT TGTAATCATA TCTTACGTGT TGAAGACCTG ACTQGAGAAA CAAAATGTGC 1080 

AATAACGTGA ATTTTATCTT AGAGATCTGT GCAGCCTATT TCTGTCACAA AAGTTATATT 1140 

55 GTCTAATAAG AGAAGTCTTA ATGGCCTCTG TGAATAATGT AACTCCAGTT ACACGGTGAC 1200 

TTTTAATAGC ATACAGTGAT TTGATGAAAG GACGTCAAAC AATGTGGCGA TGTCGTGGAA 1260 

AGTrATCTTT CCCGCTCTTT GCTGTGGTCA TTGTGTCTTG CAGAAAGGAT GGCCCTGATG 1320 

60 
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35 



CAGCAGCAGC GCCAGCTGTA ATAAAAAATA ATTCACACTA TCAGACTAGC AAGGCACTAG 1380 

AACTGGAAAA GACCACAGAA AACAAAGAAT CCAACCCTTT CATCTTACAG GTGAACAAAC 1440 

TGTGATGATG CACATGTATG TGTTTTGTAA GCTGTGAGCA CCGTAACAAA ATGTAAATTT 1500 

GCCATTATTA GGAAGTGCTG GTGGCAGTGA AGAAGCACCC AGGCCACTTG ACTCCCAGTC 1560 

TGGTGCCCTG TCTACACCAG ACAACACAGG AGCTGGGTCA GATTCCCCTC AGCTCCrTAA 1620 

CAAAGTTCCT CGAACAGAAA GTGCTTACAA AGCTGCCTTC TCGGATACTG AAAGGTCGAG 1680 

TTTTCTGAAC TGCACTGATT TTATTGCAGT TGAAAAAAAA AAAAAGCTAT TCCAAAGATT 1740 

TCAAGCTGTT CTGAGACATC TTCTGATGGC TTTACTTCCT GAGAGGCAAT GTTTTTACTT 1800 

TATGCATAAT TCATTGTTGC CAAGGAATAA AGTGAAGAAA CAGCACCTTT TAATATATAG 1860 

GTCTCTCTGG AAGAGACCTA AATTAGAAAG AGAAAACTGT GACAATTTTC ATATTCTCAT 1920 

TCTTAAAAAA CACTAATCTT AACTAACAAA AGTTCTTTTG AGAATAAGTT ACACACAATG .1980 

GCCACAGCAG TTTGTCTTTA ATAGTATAGT GCCTATACTC ATGTAATCGG TTACTCACTA 2040 

CTGCCTTTAA AAAAAAAAAC CAGCATATTT ATTGAAAACA TGAGACAGGA TTATAGTGCC 2100 

TTAACCGATA TATTTTGTGA CTTAAAAAAT ACATTTAAAA CTGCTCTTCT GCTCTAGTAC 2160 

CATGCTTAGT GCAAATGATT ATTTCTATGT ACAACTGATG CTTGTTCTTA TTTTAATAAA 2220 

TTTATCAGAG TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2280 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 2323 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GGCACGAGCC TGTGTGGTCA TGTTCCTCGT GGTGCAGTAC CTGACATGAG CCAGCCACGC 60 

TCAGTGGCTG AACAGCATTC CCACAGCCTG CAAGTGTGTG TGTGTGTGAA AGAGAGAGGG 120 

GGGCCCAGAG CCGCCTTTTG AAATGTTTGC CTGTCTGAAC TGTGAAGACA CTTOGGAGTG 180 

ATTGTGGTCT AATTTCCAAC CTGCTCTGTT TTCTGTGACA TCTTGGAGGG GAGCTAGTGC 240 

CACACCATGC GCGGTGCTTA GAAATGAAAA AGTCCCGGGT CTGTCTCTCT CACTCTCGCT 300 

CTCATGGGGG AGGGAAAGAA TGGCTTTGGT GGCTTTGTTC ACACAGCTGA TGCGTGCTGG 360 

GAAGGTGTCC ACAGTGAGCC TGTGTGCAGG ACTGTCCACA CGGTTCACAC TTGTCACCAT 420 
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CAGGCCTTTC TGGTCCTGAT AGGGTGGAGC AAAAGTGGAA AGGAAAGGAA AGAGGCTTTT 480 

CTCACAGCCA TTATATTAAA TAGTAGGTCG ATTCACATCT CX3TGCTCCTG GCCACCTTCC 540 

5 

CXTTGTGCXrrC AGTGACATGT AGATGACTGA CTGCXAATAC TTGTCACCAT TCCCTGGAAG 600 

CAGCTACCTA GGGGAAACAA GATGTAGTGC TATTGCCGAT AACAAGTAAG ATTTTCCACA 660 

10 CTAAAAAAAA AAAAAAAAAA AAA 683 



15 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2036 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

25 CTGAGAAAGG AAAGCATTCG GATCTGCTGC AAAAACACAT ATATCCATAA AGACTCATGT 60 

TATTCAGAAA ACAGATTGTG AACACAATCA CATTCGCATG AATCCTTTAA AAGGAAGAAG 120 

ACCTTAAAGT ATCTGCAAAT CTGAATTTCT ATTTATTCCT TCACTGAATA TAGAAACAAT 180 

30 

GGTTATCTGA TTATTAGAGA TATTATTITG GATATGTTAC TTATTAACTT GCTATGGCTG 240 

GTAACCATGA TAAAGTCTGT TATTAATAAC AACATAATTC TTTTTTTAAA GAAGAAAAGC 300 

35 TTATTTTTCA TTGACAGTGT ATAGATTTAT CTACTTAGTT GTGTTTTGCT ATTAGTGTTT 360 

TAATTTTTTT TTTAAGTTGA GTXJTTTGAtA AATTTTAAGA CCCTGTCCCC' ACCiUXJ'iTrr 420 

GAGTCCTGTG TTGACTACAG GTATATAGCY CAWTTTAAAA ATCCTAAAGC AAAAGAATTT 480 

40 

TATTTATAAA AGAATCMAMC MGTTGCATGC ATGAGGCTGT GAAGTCAGAT ATTTAGTAAT 540 

AAAAGCAGCA GTGCCTTTTT TTGTATTTAC CCATTGACCC CCACCAAATG CAACTGTTTT 600 

45 ATATTAAGAA AATAGTAACA ATTTTAAAAT CTCAGAGTAA AATCTATTTC ACTACATGCT 660 

TTTCCCCCCT TGTTCTGATT TAAGCAGTGT GTACTTGGCA TCTCTACATT GTCCTAGGGA 720 

CAGTGGTGTT CTACAATATT ATCATGTATG ATGTTTTATT QGTGCTTTTT ATTCATAGTG 780 

50 

GCTTCTTACC AGAAACAGTA GGAAGAAACA CATGAACTGT GTACAAGACA TGAAACATTG 840 

CTGCTGATAT GTTGTTTTTT CACATGCTTT TGAGTTTTCA CTTTTTAAAC GAGAGCCAGC 900 

55 AAGCAAAATA GATGTGGCTG GGTCTGCCTG TCCQGGCGGC TYTTTGCACC GAGCTCTCAA 960 

ATCCTGTGTA TTGAGGGTTC CTTTTTGGTA CTCAGGATTG GAGCTACAGC TGGGCCCCCC 1020 

TCTCTCCCAT TCGTTTGAAG AGACACTGAG GGAAACAAGG GTTTCnTTG AGGTGTCCTT 1080 

60 



>DOCir>t<wn nft.a<udaA2 ia> 



wo 98/39448 



PCT/US98/04493 



255 



10 



15 



20 



25 



30 



GGCTGCCTTT TACGGGATGG GAGCCTTCTC CGGATCTTTT GTTCTTCTGC ACCTCTTGTA 1140 

GCTACTGCCG GTGCAAGGTT GTAGATGTTA TTCCCCAGGA GCCTGGGCTK GGGGGCTGAG 1200 

CTGGGCTGAA TGCAAAAGCA TGCAACCAGA AGGCGGGCAA GGGGAGGAAA AGCAGGCCTG 1260 

GCCTCATTGG TCCCCTGGAG ATGTCTGTAG CAGTCAGCTC CAGCTTGGGC CTGGGGAAGC 1320 

AGCCTGACCA AGGCGCTCAG GTGTGCCTGT TACAAGAAGA ACCTGCAGAA GGATAATTTG 1380 

CACATGGAGC TGTGATAACA CTAATGTTGA TTTTACAAGT CATCAGRGAT 1440 

GTTTGCAAAG TGAGTTTTAT TTTITTGTAA TTCCTTTATC TTTACTTAAA GGTGAATGTG 1500 

TATTCCTCTG GGAGGAATAG GAAGAAAACA GGAATGTTAA TAATGTCGAA CAGAAAACTT 1560 

CCTCCCTTAT TAATATATAA TCYTCATGTA TTTATGCCOT AATGTAAGCT GACTTTTAAA 1620 

AAGCTTTCTT TTGTTGCATG CCCTGTGCAG GCATCTGTAT TGTACATGCA TGCCTTTCGT 1680 

CCTGTTTTCC TGTATAAAGT TAGTGAACAA AGAAATATTT TTGCCCTAGT TCATGTTGCC 1740 

AAGCAATGCA TATTTTTTAA ATTTGTCATA TATQGAAAGA GGATGTOTGT TACATGTAAA 1800 

AGCTTTACTG ATATACAGAT ATACTAATGT TTGAAGATGC TGTTCTTTGC AAGTGTACAG 1860 

TTTTCAAATG TTGTTACCAG TGAAACACCC TTGTGGTTTA AACTTGCTAC AATGTATTTA 1920 

TTATTCATTT CCTCCCATGT AACTAAGAAT CATGGCTATA TTTCATATCA ACGTTATATT 1980 

GAAAGTGAAG GGAAATGATT AATACAAGGT TTTGTAACAA AAAAAAANAA ANNAAA 2036 



35 



40 



45 



50 
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(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 
GGCACGAGAT AACATAGGCA CAATAATACT GTATGTCTAC 
AACATTGAGA TGACATTTCC ATTTGAGAAG AAAATAGTTG 
ATTCCTGGAG AGAGCAGACT CGCACCAACA TTCAACCCCA 
CTCAGAGGCA GAGCCCAGCA CAAAACAGCA ATGCTAGAAA 
TGCCAGCTTC GGGAATGACA CTGCAAAGCT GATGCCAGAA 
CATTACTGCT CTACCCACCC ACTTTCAGCT CCCCAAATTA 
CTCTTTACCT TTATCATTTA GGTGAGGCAT TGCACAAAAA 
GGGCTGTGGT TCTCTGTGGT CCTGGATAAG AGGCATCACC 



27: 

TTCTAGGATT ATAAGGAATT 60 

CTTTCAGTGC CTTTTATTTG 120 

GCGCTGATAT GACAGTAATC 180 

GTTACAATTG GAAAGTTTCC 240 

ACTGCCAGAG TAATTCTCCT 300 

ACTAGTGCAG TTGACTAATC 360 

CTCTCGACTT TGCCATATAA 420 

ATTATCTGGA AACATGCAGT 480 
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AAATGCAGAT TCTTCATCTT CTCCCCAGAC CTCCTGAGTT AGAAATTCAC AAGTTCTCCA 540 
GGTGATCTCA TACATGCTAA AGTTTGAGAA CCATTGAGTA AAGTTAATGC ATTAAGAAGA 600 

5 

GATTAGATAG GGATGGTGGC GTATCTTCCT ACAGTTTCCX: TGTTAACAAG AAAGTCAGAG .660 
GTCAGTTGAT CAGACATTAG ATTATTTATT GCTAAAACTA AAAAAAATTA AAAAAAA 717 

10 

(2) INFORMATION FOR SEQ ID NO: 28: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECa^ESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GAATTCGGCA CGAGCAGCAT CCTAATTTTA GTTTGGAGAT GCATTCTAAA GGATCTTCTC 60 
25 TATTGCTTTT TCTCCCACAA TTAATCTTGA TTCTGCCTGT CTGTGCACAT TTGCATGAGG 120 
AACTGAACTG TTGTTTTCAT AGGTAAATGA GAGACTGAGT TTTTTCATTT CTGAAGAGAA 180 
AGGGCATTTG CTCCTACAAG CTGAAAGGCA CCCCTGGGTG GCTGGGGCCC TCGTGGGAGT 240 

30 

TTCTGGGGGA TTGACCCTTA CAACATGCAG TGGCCCTACA GAAAAACCTG CAACTAAAAA 300 
TTATTTTTTA AAAAGGCTCC TCCAGGAAAT GCATATAAGG GCTAATCACC CAGTATTTTG 360 
35 ARGCTTCGAA GARGTAATAR AMCCCTGGAG AGAGAAACTG AGACATGTAA GAGGGTGGGA 420 
ATGACTCAGT GGTGGCACAC TATGGAGTCC TGCCCACAAG TAGCACACAT CAACCCACTA" 480 
CACAGAAATC CTAGG 495 

40 



45 



(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) liEWGTH: 556 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

AGCTTAACGT CATGATTCAT TAGGGGAATG CAAGGCAAAA CCATGATGAG AATQCCCCTA 60 

GACACCTCTT AGAAGAGCTG CTAGAAAGGC AGACAGCACC AAGCGCTTAA ATGAGATGGG 120 

GGCACTGGTG CTTCTTCTGT GCCTACTGGT AGGGGTGCAG CAGAGTGGTT CAGTCTGGGA 180 

60 CAGTTAGCTG GACATCACGT GGACCCAACA CACGCATTTC CTGGGTTACT TACCAAGGAG 240 



SDOCID: <WO 9839448A2_IA> 



wo 98/39448 



PCT/US98/04493 



257 



AATAGAAAGC AGGCAGATCT TTACAGCAGC TCTTACCTGEW TTGCAAAACA ATGGAAATCC 300 

CCACATGTCC ACAAACAAGT KTGTOGTCTG CCTGTGCCAT GAAGCACAOT CTGGCTGAGC 360 

- 5 

GTCAAGAGTC CCCACACTCA AAGGAGGCAG CAGATACAGG GCTGCACACT GTGTGATTCC 420 

ACACATGTGA CATTCTGGAC ACGGACATGC TGGATGGCAA AACGAGCATC GGGCTGAGAG 480 

10 GACTGCTGAG AAGGGGAACG GGGCTGCTGG GATGTGGGTT GATTCTAGCA GTAGCTCATG 540 

GAGATGTGAC CTCAAA 556 

15 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CTAAATGGTG ACTGTGGCTT TGTCGAGACA GGCCCCAAAT GGTAGGTGTG AACACAACAT 60 

GCACAGAATG AGGAGACATG CAGAGTGCTG AAATACTGTC CTGGACAGAT GTGTTACATG 12 0 

30 

ACTTTCTTTT CAGCTTATTT CTGTGGCCTG CCTTTGAAGA TAGAGCTTTG TTGATATTTA 180 

CATTAAACCA AATTGTATAA YTATGTTCCA TTCTGACATG TTATTTAGCA AARGAAAAAR 240 

35 GAGTAATTCT ACATCAGCAT CTTTAGTGCA TGCTAAAAGA TTAAAAATGT CTTTTGGGGA 300 

ACATGTTTTG TATAGATAAA -TGTTTAGATA- GAAATATTTA TAGAATNCTG TATGTGAGTA " 360 - 

TTNATCTCCC TATGTATATT TATATCTAGA TGTGTCAATC TTTGTATTGA TATGAAATGC 420 

40 

TATGAATAGT GAGA 434 



45 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

55 

CCACGCGTCC GATCTCACAG CTCCGACACT ATTGCGAGCC ATACACAACC TGGTGTCAGG 60 
AAACGTACTC CCAAACTAAG CCCAAGATGC AAAGTTTGGT TCAATGGGGG TTAGACAGCT 120 
60 ATGACTATCT CCAAAATGCA CCTCCTGGAT TTTTTCCGAG ACTTGGTGTT ATTGGTTTTG 180 



SOOCID: <WO_9839448A2JA> 



wo 98/39448 



258 



PCT/US98/04493 



CTGGCCTTAT TGGACTCCTT TTGGCTAGAG GTTCAAAAAT AAAGAAGCTA GTGTATCCGC 240 

CTGGTTTCAT GGGATTAGCT GCCTCCCTCT ATTATCCACA ACAAGCCATC GTGTTTGCCC 300 

5 

AGGTCAGTGG GGAGAGATTA TATGACTGGG GTTTACGAGG ATATATAGTC ATAGAAGATT 360 

TCTGGAAGGA GAACTTTCAA AAGCCAGGAA ATGTGAAQAA TTCACCTGGA ACTAAGTAGA 420 

10 AAACTCCATG CTCTGCCATC TTAATCAGTT ATAGGTAAAC ATTGGAACTC CATAGAATAA 480 

ATCAGTATTT CTACAGAAAA ATGGCATAGA AGTCAGTATT GAATGTATTA AATTGGCTTT 540 

CTTCTTCAGG AAAAACTAGA CXAGACCTCT GTTATCTTCT GTGAAATCAT CCTACAAGCA 600 

AACTAACCTG GAATCCCTTC ACCTAGAGAT AATGTACAAG CCTTAGAACT CCTCATTCTC 660 

ATGTTGCTAT TTATGTACCT AATTAAAACC CAAGTTAAAA AAAAAAAAAA AAAAA 715 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 32: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 base pairs 

(B) TVPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GAGCCAGTGC CGGCGAAAGG GGACCTTCCT CTACTTCCTG CCACAGACCC TGTCCCCACA 60 
35 CACTTCCTGC CCCTGCTCTG CTGGGAGGCC ACTTCCTCCC CCAGTGCTGG ATTCCACCCC 120 
CAGCTCACCC TCAAACATGG " CCCCCrCICT"CC^^ GCTCCCTGG^ ~180' 

GGCTGTTCTG TCCTCCCCTC TTGAAAAGCA ATGCCAGCTT CCTGGGATCT TCTGCCAACT 240 

40 

CCAGCTACCA TGCCCTTTGC TCCTGTCAGC TCAGCTCCTC AAGQGAATTG TCTAMCCTCG 300 
GTGTCCTGCT TCCCTCCCTC AACCTCCTCA CCCTGCTCCA AGCTGGCATC TGCCCCTCCA 360 
45 CTGCACAGAA CGGNTCCCCC ACCACCTGCC TTTACAGGGA GGAAGCAGCA ACATGGAAGA 420. 
ANCGAACTAT AGGGGCTACA ANGATGCTCA GCTCTGATCC CGAAGGCAAA AAGNATCTTT 480 
GGGCAC 486 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 33: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 725 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS: double 
60 (D> TOPOLOGY: linear 
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(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TAATAATTAG GTTATTCCCA GAAGCACAGT GTCATTCTTT AAATAAAAGC 60 

TTTCCTGTTT AAAGCTTTTC AAAGGAGCAG ACCACCTTGA AGATTCCCCC TAGGGTTCAT 120 

ATGTGTCTAA TTCATTTTAT AAAAATTATT CTTGTCTTCA TTTTAAAC3CT TTGGCTATAT 180 

10 AGTCAGAAAT GTCCTAAATA ACAAACTATT TTGTATTTAA TTTAGGGAAG ACTAAAGGGA 240 

AGAAAAATGA AAACTCAGTC TTTATGTAAG CTCCAAGGAT ATTAGGGCTT AAAGGGCTTT 300 

TCTAGTTTTA TGAGAATTTG TACTACTGAT TTTTATATAT TCCTGTTTTT GATGAACAGA 360 

TCTCTGGGGA AATTGTTGAG TTACAATGGC ATTTCACTGT GATCCCTGTC AAGCTCAGAT 420 

CAGTTCTATA ACCCAATGAC AACCTGTCTC TTTGGTTTAC TGTCCTGTGA AATGTCAGCT 480 

20 CAAGTTTCCC AGAAGTCGTG TGTTTATGAT GAGTCAGAGT GCTTTTCCTC GGTGGGACAG 540 

TTGCTGGCCC TCTTAATrTT GGTGTATGTG CTTCCAAGTA TCTAAACCTC CAGTCTGATC 600 

TGTATATGCT ATCCTAACTG TTAATTGTAT TATTGATTAT GTTGATTATC TTGCTTGAAG 660 

GTTCATACTT TTCAATTTGA TAGAAATAAA GTTTTTTTCT GCTTATAAAA AAAAAAAAAA 720 
AAAAA 



25 



30 



725 



(2) INFORMATION FOR SEQ ID NO: 34: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 base pairs 
. _ . (B) . TV'PEi jiucleic_acld . ..... _ . 

<C) STRANDEDNESS : double 
(D) TOPOIiOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CACACAGCAT GCTGCCCTCA GACGTGTCCA TCCTOTACCA CATGAAAACG CTGCTGCTCC 60 

45 TGCAAGATAC TGAGAGATTG T^AGCATGCTC TGGAAATGTT CCCAGAACAT TGCACGATGC 120 
CTCCTGCTTT TATTGGCTCT TGTCGAAATC AAATTGGAAG ATCTTCAGTC CCAGCTGCAC 180 
CCAACGTGGA AAAGTATTCC AGGTCCATCC CCAAGGAACC AACACCGATG ACATOGACTC 240 
AGGAATCTTA TAACCTACGT GGACTCTTTC CATCCGTACA TTGTCGTGCA CATGCCACTC 300 
ATCACCTGGC GTGCCCAGAT CCTCGCARGG CAACACCCTG TGATAATTCC AGGTCATTCT 360 

55 CTACATCTGC AGCTTGAGGT TAGCCTCATA TCACATTACA TTCTCACTAN AAACNAAAAA 420 
AAAAAAAAAA AACTCNA 437 

60 
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(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
5 <A) LENGTH: 943 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GGCACGAGCT GGAACAGAGA CTAAATCCCA CGAAACTGAC ATTGTTAAAC ACACTAAAAC 60 

AGAAGTACTT ACCTCTTGAA GATTTAATAT ATAATGGTTG ACATGATACA TGTACATGAT 120 

15 

GAATGACCAG ATGCTTATGG TCTACATTTT OdTTATCCT GTTAGTATTA CCTTCCTTAA 180 

TCTTTGTTCA TTAACATGCT AATTCCTCTT CAGTGTTTAT TTTCTAGTGA CAGAATQCTA 240 

20 ACATTTCTTA CACCCTGGCA GAAGGGAGAG AAATGTGTTT TGGGGTGGGT AACTAAATTT 300 

TTGAGTGAAA TATCATAAGA TGANAATGGA AANAAGGAGA CACAAANAGT TATNACAAAA 360 

AAACAATGGT TTTTTTAGCC ATTTGACTGG CTCTTTAAAT AGTCTACAAG ACATTCACGT 420 

25 

TTAACATCAC TTTTAGTGAA ATAAAATGTG CCATACTAGT ATGTGCTTCA AAAGGGCAAA 480 

TGTGCTTTAG TGCCCTAAGG CTAAATTTTG GTCATTTGAC ATCAGAGATG TTGTAAGTAT 540 

30 TGCACTTAAT ACGCACCTAT TTNTCAATAG TGTTATTTTT TQGNTAGCAT TTTTTTTACC 600 

ACTATNTTGT TGATAGCTTT TTGTTCTNTN AGGTTGNAAN ATGACAGTGC TNATNTCAAA 660 

CAGATTACCC ATNTGCAGAA CTAAGGGAAG CNATTTATGT ATGAAACTiAA TTNTTGAATT 720 

35 

NGTCATTNTC AACCNTTGNA TTAAAGCTTA GACTAAATAG TAATATATNG TGGGNAGGAT 780 

TTTGGTTTTG TGATATTTNT GTCTOATTAAG GNATAGATGT TAACCNTTAT TTTGTAGNAA 840 

40 AGTGANTTGT ATGTGGTTAA TTATAAATAA AACTGGTACC AGGNAAAAAA AAAAAAAAAN 900 

NAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 943 



45 



(2) INFORMATION FOR SEQ ID NO: 36: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 604 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIiOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

GGCACGAGAA ATCTTCATGC TGTAGTCACT CCAGACCATG GAGTGGCTTT CCAGCTGAAT 60 
GAATCCTATG TCTCGCGTGC AGGTGGTTGG TTTTCAATGT TCTTGCTAAT TTrrTTTCTA 120 

60 
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TTGGATCTTG GGAGTTTTCT TTGTTTGCTC CTGTGTTTGC CCAGCTTTAA TAAAACCAGG 180 

CGCAAACAAA AACCATAGCA TTCTGAACAA TAGGGGGCCC ACATTGGACC CAGTATGTCA 240 

CTTTAATGGA CTTCAAGAAA AAATCTGAAT GGGAAAAATG ACACTAGGAA TGTATACTCC 300 

ACACATTTTA TGCCATATAA TGGTGTGTTT TCTTAATTTT GTTTCTTGTG GCGAAATGTX3 360 

GCTTTCAAAT TAAAATGACC TTTTCTTCTT TGAAACTTTT TGTTTTGACT TGTATAATTA 420 

AGGGTTTGGA AAGATTCATA ATTCTGAGAG AGGTTTGCAA CCAGGAGATA CAAAGAAGTC 480 

TCAGTAGTAA TCTTGTTCAT GTGCTTITAC AGCCAGCTAC ATTTAAGGAT GTATTAGTTA 540 

CAGAAATTAT ATGTCTGTGT ATGTGTCTCT ACTCAATAAA GTACATGCCT CCACAAAAAA 600 

AAAA 604 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



GTGAGTGCCC GGGAGCCCCG AGGCCCTGCC CCTAAGAAGG ATATCTYTRA CCGCTCCCTT 60 

GTCCACACCC TAACCCCCCA GCTGCTCAGG CAGTGGGCAC ATGGCAGGGG CCTCACTGGG 120 

GGCACATAGA GCATTTGGGG GACTGCGAGT GCTCACCTTT GACTTCCTGC AGGTCGGGGG 180 

AAAACCAGAT CATGATGACC AAAGTYTACA TATTCTTGAT CTTCATQGTG CTGATCCTGC 240 

CCTCCCTGGG TCTCACCAGG TATATGCCAC CACYTTCTGY TCTAAATTCA. GAATAAGAGT 300 

CACATCAGGA GAGCACTGTC CCCAGGANAA TGCAAACGGG TTGGCAGCA 349 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 672 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GTAGTCGTTG CGGTTGCCGG GATGGCGAAG ATCTCGCCGT TTGAAGTCGT AAAACGCACC 60 
TCGGTACCGG TGCTTGTTGG TTTGGTGATT GTWATCGTTG CTACAGAGCT GATGGTGCCA 120 



wo 98/39448 



262 



PCT/US98/04493 



GGAACGGCAG CAGCGGTCAC AGGCAAGTAA ATAGTAATGC CXX5AGCAAGT TTCCTCCGGC 180 

TTTATCATGT CACCCACTGT GGTATATGCG TTGTGGTCTG CX:AACTTTGC CGTGAACAAT 240 

5 TTCAGCAATA ATCAGATGGC GGCTGGCGCA ATATTCAAGA TAACGCCTGG CAGTGGTGCX3 300 

GCTGATGGTT CAGTGCCTGC GSCACCGTTT YTGCCGTATG TTGCACACCA GGNTCTTTAA 360 

ACAGTTTTCG SACCXXX?rTT AGCGTCAAGG GTTCAATGCC GGTCGGTAGC TCGTCCTTAG 420 

10 

GTTCACCX3CG AGCATAAGCA TTAAACATCT CATCAATTTG CTTCTGGCTG GCGCTATCAA 480 

TACTTTCCAG CATATGTTTA CGCTGGCGGA AACGGGTTAG CGTTTGCCCC ARCMGWTCAT . 540 

15 AGGCAATGGG CTTAATGAGA TAATCAAATA CACCACAACG TACGGCTTCA GACACCGTTT 600 

CCATATCGCT GGCTGCAGTG GTAAACACCA CGTCGCCGGG ATAATGCGCC TGCACCAGTT 660 
CATGCAGTAA AT 

20 



25 



35 



(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1908 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: linear 



672 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

AGAGOTGATA TTTTTAGAAA CAGTAATTTT ACTTTTAAGG AAATTGGCTA GCTCTTTGAC 60 

TN^XAGAGCTG TAGGAAGCTC AACATTTCTT TGTAGAGAAC GTTGCTTITT TTGGATTGTA 120 

CAGGTATAAA AACATTGCTT TTGTTGAATT GTATAGGTGT AAAAAGGGAA TAACTGTATG 180 

40 CAGGTTTGAA AAGGAAATGT GCTTTAGGCA TGAGTCATAA GATGCCATTG TACTTGTAGG 240 

CATTTTATTT TCCTTTAGAA ATGGACATCA GCTCTTCTCT TCTGACTGGT AACACATAGC 300 

CCCAAAGCAT GAGATTATTT TTCATTGGGT TTTTATTGTT GTTTAGTTTT GGTTTGITAC 360 

45 

GCCAGCCCAG TCTGTCTGCG GAACACTGAC TCTGCTCTCT AATGAGAACA AAGTTAGAAA 420 

TCTGCCGATA ACCTAAAATA ATTTAGAAAT GAATTAAAAA TGTGAAATCG GGTTAAAGTG 480 

50 ATGATGATAA AATAGCATGC AAGAAACAAG CTCCTTCCAT CAGACTTGGC TACTGTTTTC 540 

TTCTGGTACG ATTTGGTTTG GAAGAGCCTC TTGTTTCCTT CTCTTTGGGG TATGTCTTCG 600 

TTTCTTAATA TCTTTGTAAC ATTATTGAGA TATAATTCAC ATACCTTACA ATTCACTTAT 660 

55 

TTTAAGGGTA CAATTTAGTG GTTTTTAGTG TATTCACAAA GTTGTGTAAC CGTGACCACA 720 

GTCAATTTTA GAACATTTCG TTACCCCAAA AAGAAACCCT GTACCCTTGA GCAGTCACCT 780 

60 CTCATTTTCT CCCAGTGCCC ACCCCATCCC CGAGCCCCKG GAACCACTAA TCTATTTCTC 840 
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-5 



TCTCTGTAGA TTTGCTTATT CTGGrrCATTT CATATAAATG GAATTCTACA ATATTCGGTC 900 

•mTGGGACT ggcttccx:aa ATATGATTTT CTATAOXSGAG TGAGAAAATT CTTCTCATCT 960 

TGAGAACTCr TATTGCTGTG AAAGGGAGTG GTTGGTAAAA TCAATAGATT TCAGGCAAGA 1020 

GGGCCAGATA CCTAACAGGT TTTTCTCCGT GAATCTTATG CTGAGTAGTT TTTCCTCATA 1080 

10 ACCAAGCATT TATGATATAT TACTACTTAT AATACTGTGG CTAGTCTCTA GAATGGATGT 1140 

TGAAATCTTT GCCTCCTCAG TCGGGAAGAG TCCTCCTAAA. AATCAGGCTA AAAATCAGGC 1200 

CAAAAATCAG GCCAAATGAC TTGGCAAATA ATTGACAAAG TGGTTTTCAC GTGTCTCTAT 1260 

15 

CTTTGCTAGC AGCTTGTATA CCTCAGGCCA GGTGAGCTCC CCAAATTTCT TTTTTCATTT 1320 

ACTCCAGTGA GOTTCTGCTG TCTTTTTCAA GTATGTACCA TAGGACTTAA AGGTGATTTG 1380 

20 GATGCGTTGT AACACTGCTA AATATGCTAA GTACAGAATT TTATCTACAG TACTGTGAGA 1440 

CAGTCAATTA TTGCCTAGGG TAGTTCAAAA ATATGATGTG AGCTAGTTAA GCCTTTGCTT 1500 

GACTGATTTC AGTGATATTC AGAAGTGTGT ACCAATCAAG GCTCTTTAAA ATACGGAAOG 1560 

25 

ACTCACTTAA TAACCAGGGA ACCAGCCAAA TACTGTGCAG CCGCAGAATA TGCATATCAA 1620 

TGAGOTGGAG GTGATTATTC TCTGTAACTC CCTAATGATT GTTTTCTAAG CATTGTGGCT 1680 

30 TCTCAGTGGC TTGACAGCAT CTTCCTQGTT GTATGTGGCC TGTTTACATG ATGTATTGAA 1740 

TAATGTTGTT TGTTGTGAGC ATCAATGCCT GTAACACCAA ACTAAACACG TGTTTTTGGG 1800 

ATATGTTTCC AATCTTTAAA TGACCTTGCC CTGTCCAATA AATAAATGAT TGTCTCACCC 1860 

TGTTAAAAAA AAAAAAAATT AAAAAAACTG <3GNQC3GOGGC CCGGTACN 1908 



35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 t)ase pairs 
45 (B) TYPE: nucleic acid 

(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CCTCAAAAAA AAAAANGAAA GGAAAGAGGT CTCTACACAA GCCCGTGATT CTTCATGGCA 60 

AGGGATAACA TCAGAAATGT TTCATTTYCK GCTATTAGTT TCCATTCCTT TCCCCATCCA 120 

55 GGCATAAAGA GAAACAAAAG ACAATGATGG TATTCTCTGT GTCCTCAGCT TTGGCACTTT 180 

TGTTGATGTT GCTAAGGAGC AGTGACCTTG CTAAAAAGAC TGAATAATCC ACCCACTGAA 240 

TAGCTAACCT GGGGAGGAAA TGAAAATTTC CTTTGTGGAT CTCCCCAAAT CCATTGTTGT 300 

60 
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CACCAGGCX:C TCCCAGAACC TCCTCAGTTC CTTCACAGTG CAACCCTGTG TACTTGGCCC 360 
GCAACCCAAT AGTATTGTGC CTCACTTCAC CTTCCATGGG CAACTGCCCT CCCTTCTGGA 420 
-5 CATAAAACCT CATATTTTAA ATNAAGTTGA AATTTGAA 458 



10 (2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE^3GTH: 1153 bcise pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEE^IESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

20 GGCACAGAGC CTCCGACCCA GGTGGTCTGG AGCCTGCCGG GAGAGTGGTG GCATCTGAGA 60 

GGCTGGTCGT GGACTGTGGT TGGGQGAGGT GGGAGCTGTT TTAACCGTGT GCCCCCTCTC 120 

CTGTGCCGGC GTGGGCATCC CCCGGQGCAG TGGAACCCGG GCGCTCCTCC AGCTTCCGAG 180 

25 

TCCAGCCAGC CTGGGCGCGG GGCGCGCCCC GAGACACCCG AGGAGTCCGT TCCTCCCTGG 240 

TTACGTGGAC TGTGGAGCTG GTCTCTTGTG GCTCAGCGCC GTGCGGAGGT TGAAGCGTAC " 300 

30 CTGCGGAGGT CGCACCAGGG CGTGAGGAGG AGGAGGAAGG GCATGAGCCG AGCTTGAGGA 360 

ATCCGTGCTC CAAACTCTAC ACTCAAGGAT GCACTGCGCA ACTCTGGTGG CGATGGGCTG 420 

GGGCAGATGT CCTTGGAGTT CTACCAGAAG AAGAAGTCTC GCTGGCCATT CTCAGACGAG 480 

35 

TGCATCCCAT. GGCy^GTX^ . GA^^ GTGCATGTGG_ TAGCCCTCGC CACGGAGCAG . 54^) . 

GAGCGGCAGA TCTGCCGGGA GAAGGTGGGT GAGAAACTCT GCGAGAAGAT CATCAACATC 600 

40 GTGGAGGTGA TGAATCGGCA TGAGTACTTG CCCAAGATGC CCACACAGTC QGAGGTGGAT 660 

AACGTGTTTG ACACAGGCTT GCGGGACGTG CAGCCCTACC TGTACAAGAT CTCCTTCCAG 720 

ATCACTGATG CCCTGGGCAC CTCAGTCACC ACCACCATGC GCAGGCTCAT CAAAGACACC 780 

45 

CTGCCCTCTG AGCGTCGCTG GATCTCTGGG AGCTCCTTGA TQGCTCCCAG ACCTTGGCTT 840 

TTGGGAATTG CACTTTTGGG CCTTTGGGCT CTGGAACCTG CTCTGGGTCA TTGGTGAGAC 900 

50 TTGGAAGGGG CAGCCCCCGC TGGCTTCTTG GTTTTGTGGT TGCCAGCCTC AGGTCATCCT 960 

TTTAATCTTT GCTGACGGTT CAGTCCTGCC TCTACTGTCT CTCCATAGCC CTGGTGGGGT 1020 

CCCCCTTCTT TCTCCACTGT ACAGAAGAGC CACCACTGGG ATGGGGAATA AAGTTGAGAA 1080 

55 

CATGAGTTTG GGCTGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1140 

AAAAAAAAAA AAA 1153 

60 
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(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENfGTH: 1983 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

GGCACGAGAG GGGCCGAGCC GACAAGATGT TCTTGCTGCC TCTTCCGGCT GCGGGGCGAG 60 

15 TAGTCGTCCG ACGTCTGGCC GTGAGACGTT TCGGGAGCCG GAGTCTCTCC ACCGCAGACA 120 

TGACGAAGGG CCTTGTTTTA GGAATCTATT CCAAAGAAAA AGAAGATGAT GTGCCACAGT 180 

TCACAAGTGC AGGAGAGAAT TTTGATAAAT TGTTAGCTGG AAAGCTGAGA GAGACTTTGA 240 

20 

ACATATCTGG ACCACCTCTG AAGGCAGGGA AGACTCGAAC CTTTTATGGT CTGCATCAGG 300 

ACTTCCCCAG CGTGGTGCTA GTTGGCCTCG GCAAAAAGGC AGCrGGAATC GACGAACAGG 360 

25 AAAACTGGCA TGAAGGCAAA GAAAACATCA GAGCTGCTGT TGCAGCGGGG TGCAGGCAGA 420 

TTCAAGACCT GGAGCTCTCG TCTGTGGARG TGGATCCCTG TGGAGACGCT CAGGCTGCTG 480 

CGGAGGGAGC GGTGCTTGGT CTCTATGAAT ACGATGACCT AAAGCAAAAA AAQAAGATGG 540 

30 

CTGTGTCGGC AAAGCTCTAT GGAAGTGGGG ATCAGGAGGC CTGGCAGAAA GGAGTCCTGT 600 

TTGCTTCTGG GCAGAACTTG GCACGCCAAT TGATGGAGAC GCCAGCCAAT GAGATGACGC 660 

35 CAACCAGATT TGCCGAAATT ATTGAGAAGA ATCTCAAAAG TGCTAGTAGT AAAACCGAGG 720 

- TCCATATCAG ACCCAAGTCT- TGGATTGAGG-AACAGGCAAT-GGGATGATTG-GTGAGTGTGG 780- 

CCAAAGGATC TGACGAGCCC CCAGTCTTCT TGGAAATTCA CTACAAAGGC AGCCCCAATG 840 

40 

CAAACGAACC ACCCCTGGTG TTTGTTQGGA AAGGAATTAC CTTTGACAGT GGTGGTATCT 900 

CCATCAAGGC TTCTGCAAAT ATGGACCTCA TGAGGGCTGA CATGGGAGGA GCTGCAACTA 960 

45 TATGCTCAGC CATCGTGTCT GCTGCAAAGC TTAATTTGCC CATTAATATT ATAGGTCTGG 1020 

CCCCTCTTTG TGAAAATATG CCCAGCGGCA AGGCCAACAA GCCGGGGGAT GTTGTTAGAG 1080 

CCAAAAACGG GAAGACCATC CAGGTTGATA ACACTGATGC TGAGGGGAGG CTCATACTGG 1140 

50 

CTGATGCGCT CTGTTACGCA CACACGTTTA ACCCGAAGNT CATCCTCAAT GCCGCCACCT 1200 

TAACAGGTGC CATGGATGTA GCTTTGGGAT CAGGTGCCAC TGGQGTCTTT ACCAATTCAT 1260 

55 CCTGGCTCTG GAACAAACTC TTCGAGGCGA GCATTGAAAC AGGGGACCGT GTCTGGAGGA 1320 

TGCCTCTCTT CGAACATTAT ACAAGACAGG TTGTAGATTG CCAGCTTGCT GATGTTAACA 1380 

ACATTGGAAA ATACAGATCT GCAGGAGCAT GTACAGCTGC AGCATTCCTG AAAGAATTCG 1440 

60 
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15 



20 



TAACTCATCC 
TTCCCTATCT 
TTCGTTTCAG 
AATTGGACAG 
CAAAQGATGG 
CATTTCACAC 
ATACTCTATA 
TTCATTGAGA 
AATTACTATG 
AAA 



TAAGTGGGCA 
ACGGAAAGGC 
TCAAGACAAT 
TTGAACTTAA 
TATTTAAAAA 
AAAGATTTAT 
AATGATTAAA 
AGCAAAATTG 
CACTTGTCAG 



CATTTAGACA 
ATGACTGGGA 
GCTTAGTTCA 
AAGGTTTTTG 
TGTAGAACAC 
AAAGGTAAAG 
ATTTTTAGAA 
TAACTCAGAT 
AAACAATAAA 



TAGCAGGCGT 
GGCCXACAAG 
GATACTCAAA 
AATAAATGGA 
AATGAAATTT 
TTAATATCTT 
CTTCCTAATC 
TTGTGATGCT 
TGCAACTTGT 



GATGACCAAC 
GACTCTCATT 
AATGTCTTCA 
TGAAAATCTT 
GTATGCCTTG 
ACTTGATAAG 
ACTTTTCAGA 
AGGAACATGA 
TGTGCAAAAA 



AAAGATGAAG 
GAGTTCTTAC 
CTCTGTCTTA 
TTAACGGAGA 
ATTTTTTTTT 
GATTTTTAAG 
GTATATGTTT 
GCAAACTGAA 
AAAAAAAAAA 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
1983 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHAEIACTERISTICS : 

(A) LENGTH: 1406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATGATGATGA CTTTGAAGAC GATTTTATTC CTCTTCCTCC AGCTAAGCGC CTTGAGGTTA 60 

ATAGTTGGAA AAGACTCTAT .AGATATTGAC ATTTCOTCAA ^GAGAAGAGA. ;^3^TCA^ 12 0_ 

TTAAGGCTTA ATGCCTAAGC NCTTGGTCTT AACTTGACCT GGGATAACTA CTTTAAAGAA 180 

ATAAAAAATT CCAGTCAATT ATTCCTCAAC TGAAAGTTTA GTGGCAGCAC TTCTATTGTC 240 

CCTTCACTTA TCAGCATACT ATTGTAGAAA GTGTACAGCA TACTGACTCA ATTCTTAAGT 300 

CTGATrTGTG CAAATTTTTA TCGTACTTTT TAAATAGCCT TCTTACGTGC AATTCTGAGT 360 

TAGAGGTAAA GCCCTGTTGT AAAATAAAGG CTCAAGCAAA ATTGTACAGT GATAGCAACT 420 

TTCCACACAG GACGTTGAAA ACAGTAATGT GGCTACACAG TTTTTTTAAC TGTAAGAGCA 480 

TCAGCTGGCT CTTTAATATA TGACTAAACA ATAATTTAAA ACAAATCATA GTAGCAGCAT 540 

ATTAAGGGTT TCTAGTATGC TAATATCACC AGCAATGATC OTrGGCTTTT TGATTTATTT 600 

GCTAGATGTT TCCCCCTTGG AGTTTTGTCA GTTTCACACT GTTTGCTGGC CCAGGTGTAC 660 

TGTTTGTGGC CTTTGTTAAT ATCGCAAACC ATTGGTTGGG AGTCAGATTG GTTTCTTAAA 720 

AAAAAAAAAA AAAACGACAT ACGTGACAGC TCACTTTTCA GTTCATTATA TGTACCGAGG 780 

840 



25 



30 



35 



40 



45 



50 



55 



60 GTAGCAGTGT GTGGGATGAG GTTCGATACA GNCGTATTTA TTGCTTGTCA TGTAAATTAA 
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AAACCTTGTA TTTAACTCTT TTCAATCCTT 
TGGTGCTTAT TTTTTCAAAA ATTTGCTC3TG 

-5 

CTAATGAACT ACAGCTATCT TAATTTQGTT 
CTACAAGGAA TATTAAAAAA ATCTATTCAC 
10 CATGAGTCAC AGCTTTTGTT CTGTGGTAAC 
TGTAAAGAAC TGAAAACAAT GTATATOTTG 
TCATAAGAAA TTAAAAGAAC TTACCAGGAA 

15 

TAAAATAGGA AATTATAAAT ATATAGTTTT 
TAGTTATGTA GTTATTATGN AACCACCAAG 
20 GGCCGATTAA NrrGGTTTGAA GNCTTG 



267 

TTAGATAAAA TTGTTCTTTG CAAGAATGAT 900 

AACAACGTGA TGACAACAAG CAACATTTAT 960 

CTTCAAGTTT TCTGKTGCAC TrGTAAAATG 1020 

TTTAACTTAT AATAC5TTTAT GAAATAAAAA 1080 

CTATAAAAAA AGTTTGTCTT TGAGATTCAA 1140 

TAAATATTTG TGTGTTGTGA GAAATTTTTG 1200 

GGTTTTTAAG TTAGAAATAT TCCATGCCAA 1260 

AAGCCTGCAT CAGTGGGAGT CTTGGCTATG 1320 

ATTTTTTTGG CTATTTACCG TAACCAAAGG 1380 

1406 



25 (2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1391 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

35 GGGCCTGAAG GCGGCRCGCC AGTCCCGAGC AGTGCTCGCT CCTGCTCGGG GCGCTGCGGC 60 
- - CCGGGGGGTG GGCATGAGGA- GTGAGCTGGA- GATGTPCGTG- GGGAAGAC^^ - -12 0^ 

CGAGGACGTG TATCGCCTCT GGCTCGATGG TTACTCGGTG ACCGACGCGG TGGCCCTGCG 180 

40 

GGTGCGCTCG GGAATCCTGG AGCAGACTGG CGCCACGGCA GCGGTGCTGC AGAGCGACAC 240 

CATGGACCAT TACCGCACCT TCCACATGCT CGAGCGGCTG CTGCATGCGC CGCCCAAGCT 300 

45 ACTGCACCAG CTCATCTTCC AGATTCCGCC CTCCCGGCAG GCACTACTCA TCGAGAQGTA 360 

CTATGCCTTT GATGAGGCCT TTGTTCGGGA GGTGCTGGGC AAGAAGCTGT CCAAAGGCAC 420 

CAAGAAAGAC CTGGATGACA TCAGCACCAA AACAGGCATC ACCCTCAAGA GCTGCCGGAG ^480 

50 

ACAGTTTGAC AACITTAAAC GGGTCTTCAA GGTQGTAGAG GAAATGCGGG GCTCCCTGGT 540 

GGACAATATT CAGCAACACT TCCTCCTCTC TGACCGGTTG GCCAGGGACT ATGCAGCCAT 600 

55 CGTCTTCTTT GCTAACAACC GCTTTGAGAC AGGGAAGAAA AAACTGCAGT ATCTGAGCTT 660 

CGGTGACTTT GCCTTCTGCG CTGAGCTCAT GATCCAAAAC TGGACCCTTG GACCCGTCGA 720 

CTCACAGATG GATGACATGG ACATGGACTT AGACAGGAAT TTCTCCAGGA CTTGAAGGAG 780 

60 
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CTCAAGGTGC TAGTGGCTGA CAAGGACXTTT 
GCTCTCCGGG AAAGCTGGGC GTCTTCTCTG 
_5 GGGGGCTGGT GAACX3TGCCG CCAAGCTGAC 
GGACCTCGTG GAGAAGTTTG TGGAACCCTG 
GCGGTTCTTC CTGAATCAGT ATTCAGCGTC 

10 

GGCCCTCTGG GACCGCTACA TGGGCACCCT 
CTGAGGTGCC TCCCAACGTC CGCCCACGCT 
15 GGTCCTCGCT CCGGGGAGCA AGTGGGGGGC 
ACCTGGTGTC CGTGTACAAG GATGGATGTG 
CTCAGGGAAT CSGTGTCTGTG CTCAGCCCAT 

20 

AAAAAAAAAA A 



268 

CTGGACCTGC ACAAGAGCCT GGTCTGCACT 840 

AGATGGAAGC CAACTTCAAG AACCTGTCCC 900 

CCACAATAAA GATGTCAGAG ACCTGTTTGT 960 

CCGCTCCGAC CACTGGCCAC TCAGCGACGT 1020 

TGTCCAATCC CTCGATGGCT TCCGACACCA 1080 

CCGCGGCTGC CTCCTGCGCC TGTATCATGA 1140 

GACAATAAAG TTGCTCTGAG TTTGGAGACT 1200 

GTGCAGATGT GCCTGTGTCT GTCTCTGAGC 1260 

TNCNC5TGGCT CCTTGGGAAC TGAGACATAT 1320 

(XACCAGAAG AGTCTGCTCA CAAAAAAAAA 1380 

1391 



25 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1569 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

35 

GGCACGAGTG GAGATGGCTC CGGCCCT 60 

GGCGGGCCGG CTGCCTACCC TCCAGACTGT CCGCTATGGC TCCAAGGCTG TTACCCGCCA 120 

40 CCGTCGTGTG ATGCACTTTC AGCGGCAGAA GCTGATGGCT GTGACTGAAT ATATCCCCCC 180 

GAAACCAGCC ATCCACCCAT CATGCCTGCC ATCTCCTCCC AGCCCCCCAC AGGAGGAGAT 240 

AGGCCTCATC AGGCTTCTCC GCCGGGAGAT AGCAGCAGTT TTCCAGGACA ACCGAATGAT 300 

45 

AGCCGTCTGC CAGAATGTGG CTCTGAGTGC AGAGGACAAG CTTCTTATTG CGACACCAGC 360 

TGCGGAAACA CAAGATCCTG ATGAAGGTCT TCCCCAACCA GGTCCTGAAA GCCCTTCCTG 420 

50 GAGGATTCCA AGTACCAAAA TCTGCTGCCC CTTTTTGTGG GGCACAACAT GCTGCTGGTC 480 

AGTGAAGAGC CCAAGGTCAA GGAGATGGTA CGGATCTTAA GGGACTGTGC CATTCCTGCC 540 

GCTGCTAGGT GGCTGCATTG ATGACACCAT CCTCAGCAGG CAGGGCTTTA TCAACTACTC 600 

55 

CAAGCTCCCC AGCCTGCCCC TGGTGCAGGG GGAGCTTGTA GGAGGCCTCA CCTGCCTCAC 660 

AGCCCAGACC CACTCCCTGC TCCAGCACCA GCCCCTCCAG CTGACCACCC TGTTGGACCA 720 

60 GTACATCAGA GAGCAACGCG AGRAAGGATT CTGTCATGTC GGCCAATGGG AAGCCAGATC 780 
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CTGACACTGT TCCGGACTCG TAGCCAGCCT GTTTAGCCAG CCCTGCGCAT AAATACACTC 840 

^ TGCGTTATTG GCTGTGCTCT CCTCAATGGG ACATGTGGAA GAACTTGGGG TCGQGGAGTC 900 

TGTTTGTCAC TTGGTTTTCA CTAGTAATGA TATTGTCAGG TATAGGGCCA CTTGGAGATC 960 

CAGAC3GATTC CMTTCAGAT GTCAGTCACC GGCTTCGTCC TTAGTTTTCC CAACTTOGGA 1020 

10 CGTGATAGGA GCAAAGTCTC TCCATTCTCC AGGTCCAAGG CAGAGATCCT GAAAAGATAG 1080 

GQCTATTGTC CCCTGCCTCC TTGGTCACTG CCTCTTGCTG CACGGGCTCC TGAGCCCACC 1140 

CCCTTGGGGC ACAACCTGCC ACTGCCACAG TAGCTCAACC AAGCAGTTGT GCTGAGAATC 1200 

GCACCTGGTG AGAGCCTGCT GTGTGCCAGG CTTTGTGCTG AGTGCTGTTA CATCTATTAG 1260 

TTCCTTTACT GCTGACCACA TTGTACCCAT TTCACAGAGA AGGAGCAGAG AAATTAAGTC 1320 

20 GCTTGCTCAA GGTCATGCAG TTAGTAAGTG GCAGAACAGG GACTTGAACC AAGCCCTCTC 1380 

CTCTGAAGAC CGCGTCCTGA ATTTCTTCAC TAGAGCTTCC TCATCAGGTT ACCCAGAAGT 1440 

GGGTCCCATC CACCATCCAG GTGTGCTTGG ATGTTAGTTC TCCACCCTCG AGGTCTACGC 1500 

TGTGAAAAGT TTGGGAGCAC TGCTTTATAA TAAAATGAAA TATATTCTAA AAAAAAAAAA 1560 

AAAAAAAAA I559 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 46: 



35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1924 base pairs 

(B) _TVPEj_ nucleic, acid- . . _ .. _ . _ . . 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GGGCCCCCCC WCGWKTTTTT TTTTTTTTTT TTTAATTAGG ATAATGCCTT TATTAACGAG 60 
45 AATGAAACGT TCATTCCTCC TTCCACTCCT TCTCGTTGGT TTTCTGGACA CAGCTCACCT 120 
GATCCTGCTA GAAACGTTGT CAGTCTGCTT GTGGCTTCCC TCCITGATTG ACTCACGCTC 180 
TGTGATGTCT TGAGAAGTAT CTATCCACTT CATGTGAATG AGCACTCCAA TATCAGCCAA 240 

50 

CATCAATCAT TCTTACCTAA AGAATAATAA GAAAAAGTTA ATATAAAAGA CAAGGGTATA 300 
AAATAAAGGT TTGAAAATGC TAGTCAACTT CAAAATTTAA AGAGTAAAAA TCCAGAGATA 360 
55 AAGATTGGGG GTAAGTTACA GCATAAAAAA ATAGGAAGAA ACTTCATGGT GGGGGGGAAA 420 
TCTAAAATTA TTCTTACATA AAATAAGTAG ACACCTGAAT TAGAATGAAA ACTGTATTTT 480 
CTTTAAAATG TAAAAGCCTG ACTCTCAGTT TCACCAGTCT GAGCACAAGT TTCACTGCAA 540 

60 
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10 



15 



20 



25 



30 



35 



40 



45 



CCCAAAATAT ACTATCCCTT ATGTGAAGGT ATGTGACAAC GTTGACCTCA CCAAATGAGT 600 

TTTAACATCA GCTCTTTTTT. CATATGAAAG CACATACCCT GCTCXTCCATT CAAGTATGTC 660 

TTCCATTGTC AGGCAGGCTG ACCACCTTCA GCAGGAGTCC TCCAAGAGTG CCCAACTCCC 720 

CTTCCCACAG TACACAACGC TGTAGTTGTT GTCCTGCAAT CCTTTGTATT TACCTCATTC 780 

TTTCCCATCT AAGTCCTCAC TGAGTTTTAA AGTTAGGGCT GGAAAAGCTA TGCCTTACTG 840 

GGACAGCAAG GAACCAATTT TTTTCTGAGG GAGAAGACAT TCACCTTCAC TATATGCCTG 900 

GCAGGGCCAC AGTGCACAAA ACAAAGATCA GCCTTCATTC AAGTTCCAGG TTTTTCTTCX: 960 

TCCCTGAATG ATTACTGCAA AGGGTATATG AAGTAAGAGT TCCXrTGTTGC ACATGTACCA 1020 

TCCATAAGGG ATACTATATC GTTTTGCATT CTTCCCXXXIA TTCTCCACAT TGTCCTATCT 1080 

TAAGTCCAAG CCCTTTTCAC TCTCAAAAAA AAAAAAAAAA TATTTTTTTC AGCACTGGTG 1140 

TTCAAAAGCA ACGTTTTTAT GGTTAATGGT TTACCAGCAA CTGTTGAGAT TTCCAGTTGA 1200 

GTCTTAAAAA TTGCCAATCA TTATCTAGCA GCAATGACAG ATGATTAGGA GCAGTCAAAT 1260 

CCTCTGAATT CTTTCCCTAA TAGGCAGCCA TTTGAGAACT GCACTAGCTG ACATCACTAA 1320 

AACATTATCA GCTAAAGCCA AAACCAAATA AAGGCCCAGA CCAACATCCT GGCTCTCTAA 1380 

AACCTGTCCA AAATCATTAA GTGAAAGGCA GTAAATGCAG GACTGTGGAT CATGTCACTG 1440 

CAGCTGACAA TGATTAACAA TAGGAGACAT GCAACCCXXTA TTAAGGTTAA AAGTCCAAAA 1500 

CTAGTCACAC GCATCTCTTT ATTGGGGAAA AGTGAGACTA TTATGCATTC TTGGTAQGTT 1560 

TGCAACCTTG CATGAAGAGC ACCCATTGCA TTTCTTTCAT CTTTCAGAAA GCACCGGTAT 1620 

CTGTTCCAAG-GGCCTAACAG TACGAAAATA CATTCTGGCA TCACACCTCT GAACCCAAGA" 1680 

CTGTTCTCAT TAAAAATAAT TTTGGTTTGT AACAAAATTA TGAAATACAA TGCAAGCACC 1740 

TCGGTATAGC ATTATTACTG AAACCACTTA ATTCCCAGCT TTTTGAGTTT TTTAAAAAAA 1800 

CCCACTGCAC TAAGATTCAC AATTCATTGC TACATACAAA TTAAAGCTAG TAAGAACACA 1860 

CTAACGTCAC AAGTTTCTCA TTCTAAAGTG CAAAAGCCTA ATCATCTGAA AGTGAACAQG 1920 

GTAA 1924 



50 



55 



60 



(2) INFOFtMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 475 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEEOESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
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TGGTGTGGGG CCCAGAAAMC AAGGGACCAG IXSAAAACAMC CCCAGAGACT TGTATCCX5CC 60 

^ AGGAAAGCCA TTGCGMfTYC TGAGCCCTTG AAGGGCAAGG AGGGAAACAG TGTTACCAGA 120 

GCCCAGTAAG AACTGCTGTC ATGAAGGAGG GGCCACCTTG TAAGAGACAT CATTACTACC 180 

AGAACTGTGG TGCCAAATTG CTGGTGTCTC TCTTTGGAGA AACCAACCAG ATACATCTGC 240 

10 TGGAC3ACCCA GGTGGGCACA GAGAAGGGTG GAGAGAGAAT CTGGGAAGAG AAA'TCGAGAA 300 

TAAGCAGCAC AGTGTTATTC ATTTCTGTAA ATTCCTATGT AGAAGGCTCA GTCTTAGAAA 360 

TAAAGTTATT CTACTAGTTG CAAGTTAAGT GTTTCTGTTT GTTCTGCTTT CCTGTTAGCA 420 

TAAGTAAACT CCCTTTGGAA CTACACAGGT ATGTCTCTCC TTCAACATCT GTGAA 475 



15 



20 



30 



40 



45 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 48: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 base pairs 
25 (B) TYPE: nucleic acid 

(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

AAGGGACAGA GACCTGGATT CAGATCTCAT TTTACAATGA AGACCCCAAT GCAGAAAGTC 60 

ATGTCTGAAA TTCTGAGCTT ACTCTTCTGC CTGCTGGGAC CTGCTCTCGA TGAGAGAAGG 120 

35 GAGGAAAAGG ACTAATCAGA GGAGCCAATG AAGTCACTCC ATGAGTTTCC TGAACCCTGC 180 
CCAGCTAGAG ATTAACGTYT GACCWTCAAC GTAGGACACT GTGCAGATGG GTACTTGCTG - 240- 

GCGCACATGA AGACCAAAGC CAGGACCAAG CCCCMASCCT GCTWAACACG GCAGARTCTT 300 

GCCCAGCCMA CYTCTGTGAR AATCTGCTTC CCTCCACAGC TCACCC 346 



<2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1366 base pairs 
50 (B) TYPE: nucleic acid 

(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49; 

TAGGTGTCAG CCGCCACCCC CCCCCCATAT GCAGATTTAC TSGGCATGGT AGTQGCCAGC 60 

TTCTAACACA GCTGGTATTT CAAGTCTCCT GGGACCTCAC TCAGGAATGA TACCCCCTCA 120 

GTAGAAGCAG CAGGTGATCT TAACTCCTTT CAAAGAGCAG GCCTGTCTGG GAAGCCATCT 180 
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, CCTCAGCAGG CACAGCAACC CCTCTGGAAA TGGATCACAA ACTCACTTCT CAGCCAOGCA 240 

GGCCAAGCTT CTATTGTAAC AGTAGGCACA GTATAGTCGG ATCATCACAT CAGCTGGGTT 300 

, 5 

TTTGGTTTAG TCATCTAGAG TCGTCTGGAC TAAAGGTCTT TCAGGTCTCC TTGCCCTGTG 360 

AGTGCGTGAA CCTCCCCACC CGAATTGCCT CAGTTGTCCT GAGCCTCATG TCTCTCCTQG 420 

10 TGGTGGGCCA GGCCCCTCCA TGGGAAGGGA GCCTGCTGCG GGGCAGGCCA GCTGGGGGTG 480 

CTCACCTATG CGCAATGANA GTTATTGAAG GACTGGTTGT TGATGTTGGT GAGCGTATCC 540 

TTCATGGCCA GCGCGAAGTC GGCCAGGTCA GCCAGGTGCT GCCAGCGCTC TCTCTCGGAC 600 

15 

TTGTCTTCCT GTGCCAGGGG ACCGTGGAGA AAGTGTCAGG GGCCGCTCAC TGCAGCAGCC 660 

TGCTCTGCTG CCTTCCCTGG CAGTGTTCTG GGGGTGGATT CCCTACAMCT AGATGTTCAA 720 

20 GGCCTTACTT TTCCTCCCAC AAAGGAGTCG CAGCCACGCT AGCTCTGACT TGCCACTGTG 780 

ACAAAGTTCA CGTAGCAGGT CTAGGCAAAG ACTGGGCAAT TGAGCAGAQG AGACGGACCT 840 

GTGAGTCTGA CCRYGAGSCG GRCCCCTTCA CCTTGGCTGG GCTGGTCCTG GTCCTTAGGT 900 

25 

TTTGTCAGGT TGTCCTTGTT TGGATCCCTC AACTAGGTGA TAAGCACTGG AGGGGGATGA 960 

CCCGCCTTGG ACGTGTTTCT TTAACCTCAT CCATATAATA GGGCCGTGGG ATGGTTGTAG 1020 

30 AGGTAAAGCA GGATGATGGT GTTTTAAGAC CAGAGCTTGG GACCAGGGCT CCTACACCTA 1080 

ATTTTCTCTC CTGGTAGCTG AACAAAGGTC TAAATTAGCT TAACAAAAGA ACAGGCTGCC 1140 

GTCAGCCAGA GTTCTGAAGG CCATGCTTTC AGTTTCCCTT GTTGACAATT GCTCTCCAGT 1200 

35 

TCCTATGAAA GCACAGAGCC TTAGGGGGCC TGGCXrACAGA ACACAACCAT CTTAGGCCTG 1260 

AGCTGTGAAC AGCAGGGGGT TGTGTGTCTG TTCTGTTTCT CTGCTTGCCG AACTTTCrCA 1320 

40 ATAAACCXTTA TTTCTTATTT ATAAAAAAAA AAAAAAAAAA AAAAAA 1366 



45 (2) INFORMATION FOR SEQ ID NO: 50: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1405 base pairs 

(B) TYTE: nucleic acid 
50 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUE^KrE DESCRIPTION: SEQ ID NO: 50: 
55 GCAGTAATTC CTGTTAGCCA CTGCATCCAC CAAAACTAGT TTATTTTTCC CCTCAAATTC 60 
ATGATTTTTA CGTCTGTTAC AAAGGGAATT TTGCTGATAG CTCTTTGGGT CCCACTGTTC 120 
CATTTTATGC TAATAGATTC CATTCTAGGG CCCAGCCGTC TCTTGACTGA TGGTGTTCCC 180 

60 
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TTTAACCCTT GGCATGTATA ATAGAATTTT GGTGAATGAA AGAACCCAAA TAGGCCAGAT 240 

AGTCCCCCCA GGCCCTGATA TCCATAAAAG GCTTGGGAAT GCATTATGTA ATTGTCCTTA 300 

GTCTTITTGT TGTTTTAGAA AAAAAAAACA AGATGGGCTC AGATGGATGC CTACX5TAAAA 360 

ATQGTTCCTA GCTGTGTACT CATAACTTTT CTTTGAATTG AGTAGTGAAA GGAAGGAGGA 420 

GGAAAGGAAA TTAAATGTCC TTCTAGTATT CTCTGGACTC AAGTCTGACA TATGAGATAA 480 

TAACCTATAT TGAAATGCCA AGAATTGTAT CTGAAACAAG AGAACAGTTT GACACATTTA 540 

TCATGCCTTC ATATTACATA TTAACTGAAA CCAATTAATA AACATATGAA ATATCCATTG 600 

15 CACAAGGCAA AGGCACCTAA ACCTTTTGTT TCTTTTTCTA CATAGCAGAA ATTGATTTOT 660 

TTTTTATTTT TTTAGGGGAA CCTATATAAT TATGACCXJVG TGATGTCTTT TGGTGACTTA 720 

AGCTTATGAA TTCAGGTTAC AATTGAGTTG ATTCTAGATG GTTACTACCT TGAAAAGGAT 780 

20 

GTTGGTGCCT TATGTGACAC GAGCCAGAGC CTGCTGGGGA ATAAACAAAG CAGGTTTCAT 840 

GCCAACACCA ACTCGTAGCT TTAGTGGGCA GATGGGGAGT GGTTCACAGA CTTCCCAAAA 900 

25 TGTGGGGGCT TTGGGATTTT CCACACCATC CCACGTGTGT TGTTCATTCT TCCTCTTTTC 960 

ACACTCTTGG ATGGATWATT TGRAAATCGT GRAAWYMMCY YYKRAATTTG CX:CAATAGCC 1020 

WTGRGCCACC ATTCTTWATG ACACCATAAC CAAATAGTTC CWTAATGTTG AAATATTAGA 1080 

30 

AACCTGTTAC CAGCXTYKSMA KIWACCCWWA mTXTCCCAT GTTTGTGGAA TTGATATTGA 1140 

AATAGCAGGG CTAAGGAATT ACTGGCAAGT TTTAGCCTGT GGGTAATACC TTAGGGTTAT 1200 

35 TTAAATATTT GTAATTTTAT TTAAATGTTC ATGAATGTTT GAAAGGAACA AAATTATCAG 1260 

GGATOGGTCP TTGGGATGGG TGTTATTTTe-AeceTCTrTT" CTGTAAGAAA AAAGAACAAT -1320" 

GTCTTAATGT ATTTTTAAAG TTTTTGGTAT AGTTTCTAAT TCCAATTTTA ATAAAAGTTT 1380 

TWTRTAAAAA AAAAAAAAAA AAAAA 1405 



40 



45 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

55 

CGGATTTTCT AGGACCCCAA AAAAAAAAAA AGGGNAAAAA AAACCCNCAA AACCANCCAA 60 
AACCCCAAAA AAAAAAAAAA TCCACAAAAA CAAAAAAACT ATAAAAAAGA AAGAATTAAA 120 
60 AACTTTCAGA GAATTACTAT TTACTTTATT AACTTACGGA TTTATTATAT AAATATATAT 180 
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TCACCTAGCA ACATATCTCT GCCGTCTCTC CTGCTCTCAT AATGAAGACA TAGCCGATTC 240 

^ TCTGCCCGGG CCXCTTGCTG ATGCTCCTCC GGGTCTGCGT CX3GGCGTGGG TCTCTGQGGA 300 

CCCTCCAGAG GTGGAGGTGG GCTGATGGCC TGGCTGCCTG GTGGTTGATG GTTTTGCTCC 360 

CCXTTACCTTT TTTTTTTGAG TTTATTCTGA TTGATTTTTT TTCTTGGTTT CTGGATAAAC 420 

10 CACCCTCTGG GGACAGGATA ATAAAACATG TAATATTTTT AAGAAGGAAA AAAAAAAAAA 480 

AAAAAACTNG GGGGGGGCCC CGAA 504 



15 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiable 

(D) TOPOIiOGy: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

NAAGTATCTT GGCCAGTTTA TTACAGAGGA CGATAAATGA TrCCATGTGGj ATAGGGCATA 60 

ACATACAGAG AATGAGACTA TGCCAGAAAT GGGAGGAGGC ATTTGAAACA ACATGAGTAT 120 

30 

CTCAGGGACA GATGGATTGA TTCTGCTATT GGTAGGCCTG GAAGCAANGG TCAGAAGTAG 180 

CAAAAAATGG ATACCAAAAG CACTATTWGT CACCCAAGCT AAGTGGAATA GCTGGCCCAG 240 

35 TAGGAGAAAT GCAGGTTTTG CTCTACACTA AGTTCTCCAA CTCTTGATAA GCCTCCAAAA 300 

ACAAATGTTA' GCX5GAAAAAA" 7VCGCAGCTGG "TTATGAASAG ATATMCTCA" TinXIATOAAA 360 

AAATCAATGT CAATGCTGTT AATAGAATCC TTTTATCTTC AGGACAGAGG CAATGCCCTA 420 

40 

AACAAACACC AGCTCAAGAG CCTCTGATGC CAACCTAGAG GGTACCCAAA CACAAACTTA 480 

GCATAGAGGT AAGAATCTCT ATGTCTTTTG GTGGAGGCAA AGCCATTTGG TTGGTACTTC 540 

45 ACAGGAACAT CTTTCTACCA AGTCTTCATC ATATGGTATG TGCCACGAGT CTCCAGTTGT 600 

TTGCACCACT GTGTCATAGC TGAGAATACG CTGAAAGGTT AGTTTTGATC CTCGAAACCT 660 

ATTTACAATT GCCAGCTGAT GTCCCTGCTG CCACTTAAAA AAGGCTTGGG TCTGGCATAG 720 

50 

GCAGAMAGGC CTGTGGTCCC CTCGTGCCGA TTCTOGGCTC GAGGCCAATT NCCTTAT 777 



55 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

.5 

ATGACTACAG TGTTATACCC TCCAATCTTT GCAGGTQGGC ATGGAACACT GCTTGTATCA 60 

CTCTGTGCAC GGTATAAATC CATATATCCA CAAAAACACA CATCCATCCA TCAACATATA 120 

10 CATGGTTTGG GATGAGCAGG TCAATAGTTT TGAGAGQGAG TTTGTTCCTT TTTTTTTTCT 180 

CATTATACTC TTAAATTGTT GTCAGTTATC AAACAAACAA ACAGAAAAAT TGTTTGGAAA 240 

AACCTTGCAT ACGCCTTTTC TATCAAGTGC TTTAAAATAT AGACTAAATA CACACATCCT 300 

15 

GCCAGTTTTT TCTTACAGTG ACAGTATCCT TACCTGCCAT TTAATATTAG CCTCGTATTT 360 

TTCTCACGTA TATTTACCTG TGACTTGTAT TTGTTATTTA AACAGGAAAA AAAACATTCA 420 

20 AAAAAAGAAA AATTAACTGT AGCGCTTCAT TATACTATTA TATTATTATT ATTATTGTGA 480 

CATTTTGGAA TACTGTGGAA GTTTTATCTC TTGCATATAC TTTATACGGA AGTATTACGC 540 

CTTAAAAATA CGAAAATAAA TTTTACAAGG TTCCGGTTTT GGTGGTGGAA AGAGTAAATT 600 

25 

GA 602 



30 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1749 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY:, linecLT . . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

40 

AGTCACTGAC TTGGAGCCGC TCGGGGGAAG TCCCGCCCAG ACAGGCGGTG GGTGGGAATG 60 
CCTCACTTCA GTTTGAAGAG GGTCCGGATC CAAAGGGGTT AAAACGAGCG AACCCCGATC 120 
45 CCCGACCACA CTTCCCGCCT CCCTAAAACG CACACCCCGC TAGCCATGGG CAGCCGCGAC 180 
CACCTGrrCA AAGTGCTGGT GGTGGGGGAC GCCGCAGTGG GCAAGACGTC GCTGGTGCAG 240 
GATTATTCCC AGGACAGCTT CAGCAAACAC TACAAGTCCA CGGTGGGAGT GGATTTTGCT 300 

50 

CTGT^AGGTTC TCCAGTGGTC TGACTACGAG ATAGTGCGGC TTCAGCTGTG GGATATTGCA 360 
GGGCAGGAGC GCTTCACCTC TATGACACGA TTGTATTATC GGGATGCCTC TGCCTGTGTT 420 
55 ATTATGTTTG ACGTTACCAA TGCCACTACC TTCAGCAACA GCCAGAGGTG GAAACAGGAC 480 
CTAGACAGCA AGCTCACACT ACCCAATGGA GAGCCGGTGC CCTGCCTGCT CTTGGCCAAC 540 
AAGTGTGATC TGTCCCCTTG GGCAGTGAGC CGGGACCAGA TTGACCGGTT CAGTAAAGAG 600 

60 
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AACGGTTTCA CAGGTTGGAC AGAAACATCA 
ATGAGAGTCC TCATTGAAAA GATGATGAGA 
_5 ACCCAAGGGG ACTACATCAA TCTACAAACC 
TGTTTGGCTT ATTTTCCATC CCAGTTCTGG 
CCCACCTGAC CATTTTATTA AGTACATTTG 

10 

GGGCCTATTG TCACTTAGAA AAGACACCTG 
TTAGCCTTTC ACATGTTGCT GRCTCACATT 
15 TTCTCATCAG CCCTCAATTT GTGATCCGGA 
TGCGTTTTAG AGATCATAAT TCTCACCTAC 
TTGATATCAT GACTTCCAAT TGAGAGGAAA 

20 

TGTAGGCCGT TGTTTCAGAT TCTTTCTGTC 
AGAAGGAGGG GTCTGGGCAT CTGTGGATTT 
25 GTATTTTTGA AACITCTAAC GTCATAATTA 
CAAGTTTTTT GGCCGGGCAT GGTGGCTCAT 
GGCAGGCGGA TCACATGAGG CCAGGAATTC 

30 

CGTCTCTACT AAAAGTACAA AAATTAGCCA 
TACTCTGGAG ACTGAGGTGG GAGAATCGCT 
35 CGAGATCATG CCACCGCACT TCAGCCTGGG 
AAAAAAAAAA AAAACTCX3AG GGGGGGCCCG 
ACAATCNAA 

40 



276 

GTCAAGGAGA ACAAAAATAT TAATGAGGCT 660 

AATTCCACAG AAC3ATATCAT GTCTTTGTCC 720 

AAGTCCTCCA GCTGGTCXrTG CTGCTAGTAG 780 

GAGGTCTTTT AAGTCTCTTC CCTTTGGTTG 840 

AATTGTCTCC TGACTACTGT CCAGTAAGGA 900 

GAACCCATGT GCATTTCTGC ATCTCCTGGA 960 

AGTGCCAGTT AGTGCCTTCG GTGTAAGATC 1020 

ATTTTGTGAG AAGGATTAGA AATCAGCACC 1080 

TTCTGAGCTT ATTTTTCCAT TTGATATTCA 1140 

ATGAGATCAA ATGTCATTTC CCAAATTTCT 1200 

TTGGAATGTA AACATCTGAT TCTGGAATGC 1260 

TTGGCTACTA GAAGTGTCCC AGAAGTCACT 1320 

AGTTTCTCTT GTCTTGGCAT CAAGAATAGT 1380 

GCCKGTAATC CCAGCACTTG GGGAGGCCAA 1440 

GAGACCAACC TGGTCAGCAT GGCAAAACCC 1500 

GGCGTGATGG CT^CGTGTCTO TAATCCCAGC 1560 

TGAGACTGGG AGGCAGAGGT TGCAGTGAAC 1620 

TGACAGAGAA GGACTCCGTC TCAAAAAAAA 1680 

GTACCCAAAT CXKrCSTGATA OTGATCGTAW 1740 

1749 



(2) INFORMATION FOR SEQ ID NO: 55: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1896 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

AAAGAGATGG GCTCTTTATT TTCTCGAAAA ACCAATTTGG AGTTACTCAT TTTTCCATAA 60 

55 

CATTAAATTT CTTACAGTGA ACTACATATT GTCCATAAGT GCTTCATCAG GACTCATCGC 120 

CCTCCTGTCT ACTGGCTCCA AATAGACCAT GTCAGCTTCA CCCCCTGGCT TTGTGTCTAT 180 

60 GGGTGGCCTG TGGTATATGG AAAAGTAGCA GGGTGGTCAG GGTGGGAGAC ACAAGATGTT 240 
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TTTATAGTCT AGAGCCTTTA AAAAACCCAG CAGAATGTAA TTCAGTATTT GTTTATTCGC 300 

^ TGTTTTTrGA CAGATTGTTG AAATTAAATG AATTGAAAGG GAAACTCAGA GTACTAGGAC 360 

GTTTATTAAA AGGAAAAAAA TGTCTTGCAA TGTGCTGTAA TCACAAGAGG AGAAAATAAC 420 

TTGTTTCCTT GATCTGTCAG AGGTCACAGT AACCTGGGCC GAGCTGTTAT TATTTATTAT 480 

10 ATAATAGTAG TAGGAAGTTA ATAACTGGTT CTCTGTGTTC CAAGCACAAT ATTACAACTr 540 

CITTTGAACC GTAAATATCA GAATGAATCC TCTTCCCAGG GGATTGAACA GAAGCTTAAT 600 

GTTTACAAGT GTTTGAATTT GTGATCTGAA ATAACACAAA ATTAAAAACA TGATTTCTCT 660 

15 

AATTTTCCAA CTAGAGGAAG AGAAACTTGT GGAAAAGTTC TTTTTTTTTC a ' ri '' l ' ri '' rri ' l ' 720 

CTTAAAGAAG GGCAGCCAAG GTAGTAACCT AAAAATAGTG CCCAGGCATA TGAGAGTTGT 780 

20 CCTACGAGGT TAAAGAACAC ACTGTTCCAC TGTATGGCTT TGGCCCTGAG TGGCCAQGGA 840 

GGTCAACTIG ACCCTGCCAT GTTGGTTTGA CTTACTAAGA CACAGGAATC ATTGTTTTCC 900 

TTGACCAGGG TCTCACACCC TGGAGGAATG TTAAGTAAGA GAAAGAACCT CTTTCCTGAA 960 

25 

TATTGACATG TAAAAGACCA AAGTAATTTT TCTG7ACTTC TGCAATTCTG AGAACTCTCC 1020 

AAGGAATTTA CAGTGATTTT AGTGCTTGTC AGCATTTTTC CATGAGGACT TTCATACATT 1080 

30 TGACTCTTTA GTTCACAGGT TCCCATTGAT TGTGAGCAAG ATATTTATCT CTTTAGCCCT 1140 

TGGGGATCCA GCTGAGAGCA ATCTCTTGCA TTTTTTTACC CX3TGTATGTA CAGATATCAT 1200 

TTCTTGTGTA TGCCATGACT TGAAAAAGTT TGGGAAGCTC TTTAGCAATA TCAGCTAAAA 1260 

35 

GGATATGAAA TCACAGGTGA TAGCAGTTGT CATTCAGTAA TTTCCTACAA GCAGCACCCC 1320 

AAAGGAAATA TAGTCCTAAT CTTTACTATC CACTTCTAAA TTTAATGTGA ATTTCATACA 1380 

40 TGTTATTAGT TGTTTTCTTT ATAATTTTAT AAAAATTATT CATCGGGAGT TTAACTTCCA 1440 

CTTCCATGCT ATCGGATGTG TTGGGCTCCA TGCAAGAACT TGGAAGAAAA ACAGGCAGGA 1500 

ATGCATTTGC ATAATGACCC AGATCATCAT TTTCTGCAAC TGAGAATTAT ATTTCATCAT 1560 

45 

TGCTTCTAGA AGTCTGCAAT TCTTTACTTT TCTTTGGTGC ATTATTATCT AGGTGCCATC 1620 

ACTGGATAAT GTGGAGTGAC TAGAGAAGTC AYATATCACT GTAAGGTACA GTTAGGGGTA 1680 

50 ACACTTTAGA GGTTTATTAT TTTTAAAAAA CTTTTCITGA ACTCCTQGGC CAACATGGGT 1740 

GAAACCCXXST CTTCTTACTT AAAAATACCC AAAATTAGGC CAGGGGCGTG GATGGGTCGG 1800 

GTGCCTGTTA ATCTTCAGCT ACTTNGGGGA GGGCTTGAAG CX:AGGGAGGA ACTGCCCTGG 1860 

55 

ANCCCCGGGG NGGGCCAGNA GGTTTGCCAG TTGAGT 1896 

60 
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(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1753 base pairs 
-5 (B> TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

10 

TCTTTTTAAA ATAGACATTT GTGGGGCTCA CACAATATAT GAAATAGTAC CCTCTAAAAA 60 

AGAGAAAAAA AAAATCAGGC GGTCAAACTT AGAGCAACAT TGTCTTATTA AAGCATAGTT 120 

15 TATTTCACTA GAAAAAATTT AATATCAAGG ACTATTACAT ACTTCATTAC TAGGAAGTTC 180 

TTTTTAAAAT GACACTTAAA ACAATCACTG AAAACTTGAT CCACATCACA CCCTGTTTAT 240 

TTTCCTTAAA CATCTTGGAA GCCTAAGCTT CTGAGAATCA TGTGGCAAGT GTGATGGGCA 300 

20 

GTAAAATACC AGAGAAGATG TTTAGTAGCA ATTAAAQGCT GTTTGCACCT TTAAGGACCA 360 

GCTGGGCTGT AGTGATTCCT GGGGCCAGAG TGGCATTATG TTTTTACAAA ATAATGACAT 420 

25 ATGTCACATG TTTGCATGTT TGTTTGCTTG TTGAATTTTT GAACAGCCAG TTGACCAATC 480 

ATAGAAAGTA TTACTTTCTT TCATATGGTT TTTGGTTCAC TGGCTTAAGA GGTTTCTCAG 540 

AATATCTATG GCCACAGCAG CATACCAGTT TCCATCCTAA TAGGAATGAA ATTAATTTTG 600 

30 

TATCTACTGA TAACAGAATC TGGGTCACAT GAAAAAAAAT CATTTTATCC GTCTTTTAAG 660 

TATATGTTTA AAATAATAAT TTATGTGTCT GCATATTGCA GAACAGCTCT GAGAGCAACA 720 

35 GTITCCCATT AACTCTTTCT GACCAATAGT GCTGGCACCG TTGCTTCCTC TTTGGGAAGA 780 

GGAAAGGGTG TGTGAACATG GCTAACAATC TTCAAATACC CAAATTGTGA TAGCATAAAT 840 

AAAGTATTTA TTTTATGCCT CAGTATATTA TTATTTAATT TTTTAGGTAA TGCCTATCTC 900 

40 

TTGGTCTATT AAGGAAAGAA GCAATCAGTA GAGAATTCAG GATAGTTTTG TTTAAATTCT 960 

TGCAGATTAC ATGTTTTTAC AGTGGCCTGC TATTGAGGAA AGGTATTCTT CYATACAACT 1020 

45 TGTTTTAACC TTTGAGAACA TTGACAGAAA TTATGCAATG GTTTGTTGAG ATACGGACTT 1080 

GATGGTGCTG TTTAATCAGT TTGCTTCCAA AGTGGCCTAC TCAAGAGGCC CTAAGACTGG 1140 

TAGAAATTAA AAGGATTTCA AAAACTTTCT ATTCCTTTCT TAAACCTACC AGCAAACTAG 1200 

50 

GATTGTGATA GCAATGAATG GTATGATGAA GAAAGTTTGA CCAAATTTGT TTTTTTGTTG 1260 

TTGTTGTTGT TTTGAATTTG AAATCATTCT TATTCCCTTT AAGAATGTTT ATGTATGAGT 1320 

55 GTGAAGATGC TAGCGAACCT ATGCTCAGAT ATTCATCGTA AGTCTCCCTT CACCTGTTAC 1380 

AGAGTTTCAG ATCGGTCACT GATAGTATGT ATTTCTTTAG TAAGAATGTG TTAAAATTAC 1440 

AATGATCTTT TAAAAAGATG ATGCAGTTCT GTATTTATTG TGCTGTGTCT GGTCCTAAGT 1500 
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GGAGCCAATT AAACAAGTTT CATATGTATT TTTCCAGTGT TGAATCTCAC ACACTGTACT 1560 

TTGAAAATTT CCTTCCATCC TGAATAACGA ATAGAAGAGG CCATATATAT TGCCTCCTTA 1620 

^ TCCTTGAGAT TTCACTACCT TTATGTTAAA AGTTGTGTAT AATTGTTAAA ATCTGTGAAA 1680 

GAATAAAAAG TGGATTTAAA TTAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1740 

AAAAAAAAGG GGG 1753 

10 



15 



(2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
20 (D) TOPOLOGY: linear 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

GCGGAAGTTA CTGCAGCCGC GGTGTTGTGC TGTGGGGAAG GGAGAAGGAT TTGTAAACCC 60 

CGGAGCGAGG TTCTGCTTAC CCGAGGCCGC TGCTGTGCGG AGACCCCCGG GTGAAGCCAC 120 

CGTCATCATG TCTGACCAGG AGGCAAAACC TTCAACTGAG GACTTGGGGG ATAAGAAGGA 180 

30 AGGTGAATAT ATTAAACTCA AAGTCATTGG ACAGGATAGC AGTGAGATTC ACTTCAAAGT 240 

GAAAATGACA ACACATCTCA AGAAACTCAA AGAATCATAC TGTCAAAGAC AGGGTGTTCC 300 

AATGAATTCA CTCAGGTTTC TCTTTGAGGG TCAGAGAATT GCTGATAATC ATACTCCAAA 360 

35 

AGAACTGGGA ATGGAGGAAG AAGATGTGAT TGAAGTTTAT CAGGAACAAA CGGGGGGTCA 420 

TTCAACAGTT TAGATATTCT TTTTATTOTT TTTCTTTTCC CTCAATCCTT TTTTATTTTT 480 

40 AAAAATAGTT CTTTTGTAAT GTGGTGTTCA AAACGGAATT GAAAACTGGC ACCCCATCTC 540 

TTTGAAACAT CTGGTAATTT GAATTCTAGT GCTCATTATT CATTATTGTT TGTTTTCATT 600 

GTGCTGATTT TTGGTGATCA AGCCTCAGTC CCCTTCATAT TACCCTCTCC TTTTTAAAAA 660 

45 

TTACGTGTGC ACAGAGAGGT CACCTTTTTC AGGACATTGC ATTTTCAGGC TTGTGGTGAT 720 

AAATAAGATC GACCAATGCA AGTGTTCATA ATGACTTTCC AATTGGCCCT GATGTTCTAG 780 

50 CATGTGATTA CTTCACTCCT GGACTGTGAC TTTCAGTGGG AGATGGAAGT TTTTCAGAGA 840 

ACTGAACTGT GGAAAAATGA CCTTTCCTTA ACTTGAAGCT ACTTTTAAAA TTTGAGGGTC 900 

TGGACCAAAA GAAGAGGAAT ATCAGGTTGA AGTCAAGATG ACAGATAAQG TGAGAGTAAT 960 

55 

GACTAACTCC AAAGATOGCT TCACTGAAGA AAAGGCATTT TAAGATTTTT TAAAAATCOT 1020 

GTCAGAAGAT CCCAGAAAAG TTCTAATTTT CATTAGCAAT TAATAAAC3CT ATACATGCAG 1080 

60 AAATGAATAC AACAGAACAC TGCTCTTTTT GATTTTATTT GTACTTTTTG GCCTGGGATA 1140 
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TGGGTTTTAA ATGGACATTG TCTGTACXAG CTTCATTAAA ATAAACAATA TTTCTAAAAA 1200 
. TCAWAAAAAA AAAAAAAAAA 1220 

5 



10 



(2) INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 1049 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
15 (D) TOPOLOGY: linear 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

TCGCGCCTGC AGACACAGCA TCTACTCAGC GTGGGTCACC TCTGTGAACA TCACTGACTG 60 

CAAGCCTCCC TCAATTTCTG GTGCAGCCCA TCAGGGACCC ACAGCGCCTG GGAGGATGGT 120 

GCQGATCTTG GCCAATGGGG AAATCGTGCA GGACGACGAC CCCCGAGTGA GGACCACTAC 180 

25 CCAGCCACCA AGAGGTAGCA TTCCTCGACA GAGCTTCTTC AATAGGGGCC ATGGTGCTCC 240 

CCCAQGGGGT CCTGGCCCCC GCCAGCAGCA GGCAGGTGCC AGGCTGGGTG CTGCTCAGTC 300 

CCCCTTCAAT GACCTCAACC GGCAGCTGGT GAACATGGGC TTTCCGCAGT GGCATCTCGG 360 

30 

CAACCATGCT GTGGAGCCGG TGACCTCCAT CCTGCTCCTC TTCCTGCTCA TGATGCTTGG 420 

TGTTCGTGGC CTCCTCCTGG TTGGCCTTGT CTACCTGGTG TCCCACCTGA GTCAGCGGTG 480 

35 ACCTCTGAGG GCTGATAGGG GTGGGTTTGT TGAGAGGGAC TTGCTGGGCC TTGGTGTGAG 540 

AGC^GCXTVTA TTTCGA ATCTCGTCGT ISCCT"^^ GACX^GGACCA 600 

CAGGTGTGTG TTTCCCCTTT GTGTTAAGCG TGAGGCAGAG GGAGACGTTA GTCCCAGCAT 660 

40 

TTCCCAAAGT GTGGGTGGGT CCGTTGGTTC CCGAGATACT TTTAGGTGGT ATGGGGCCTG 720 

CATTAAGTGG CACAAAATCA GAGCAAGAAA GCGATGCCCT TCCCAATTCT CTCAATCCTT 780 

45 TTATGCCGAG AAGATCTCAG CTGGATGCCA ACATGTTCCG ATGCCTGTGG AAGACATGCC 840 

GACGTCTCCT CTGCCTAGGG AGCAGGACTT GGGCTTAGGG CAGGTGGAAA AAATTCCAGA 900 

CTTTTTTAGC ACTGTTTTTG TTTTAATGGT ATATTTTTAT TGGCTACTTT ATTGTTTAGG 960 

50 

ACAAGTGGTA GTGGCATTCT ATTTATTGTG ACCTTTTCAA TAAATAGATT TAAGTAAAAA 1020 

AAAAAAAAAA AAAACTCGAG GGGGGGCCC 1049 

55 

(2) INFORMATION FOR SEQ ID NO: 59: 
60 (i) SEQUENCE CHARACTERISTICS: 



wo 98/39448 



PCT/US98/04493 



281 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(A) I.ENGTH: 1776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

AAAGAGGATG TCSMAGCTAGA GGTCCCCGAT GGCTGGTCGG ATGGGAAGCA CAAGGCTGAG 60 

GGACTGGATT GTAAAGGCAC TAAGTCGTTC TGCGGTGAGA ATCAGACATG GGGGACCTCT 120 

AGCTTCACAT CCTCTTTCCT TGCAGSTCTG GACATCCTGA GCCCAAGTCC CCCACACTCA 180 

GTGCAGTGAT GAGTGCGGAA GTGAAGGTGA CAGGGCAGAA CCAGGAGCAA TTTCTGCTCC 240 

TAGCCAAGTC GGCCAAGGGG GCAGCGCTGG CCACACTCAT CCATCAGGTG CTQGAGGCCC 300 

CTGGTGTCTA CGTGTTTGGA GAACTGCTGG ACATGCCCAA TGTTAGAGAG CTGGCTGAGA 360 

GTGACTTTGC CTCTACCTTC CGGCTGCTCA CAGTGTTTGC TTATGGGACA TACGCTGACT 420 

ACTTAGCTGA AGCCCGGAAT CTTCCTCCAC TAACAGAGGC TCAGAAGAAT AAGCTTCGAC 480 

ACCTCTCAGT TGTCACCCTG GCTGCTAAAG TAAAGTGTAT CCCATATGCA GTGTTGCTGG 540 

AGGCTCTTGC CCTGCGTAAT GTGCGGCAGC TGGAAGACCT TGTGATTGAG GCTGTGTATG 600 

CTGACGTGCT TCGTGGCTCC CTGGACCAGC GCAACCAGCG GCTCGAGGTT GACTACAGCA 660 

TCGGGCGGGA CATCCAGCGC CAGGACCTCA GTGCCATTGC CCGAACCCTK AANAAAAACC 720 

ATTAAAGTTA CGACGGCAGC AGCAGCCGCA GCCACATCTC AGGACCCTGA GCAACACCTG 780 

ACTGAGCTGA GGGAACCAGC TCCTGGCACC AACCAGCGCC ASCCAGCAAG AAAGCCTCAA 840 

AGGGCAAGGG GCTCCGAGGG ANCGCCAAGA TTTGGTCCAA GTCGAATTGA AAGRACTGTC 900 

GTTTCCTCCC TGGQGATGTG GGGTCCCAGC TGCCTGCCTG CCTCTTAGGA GTCCTCAGAG 960 

AGCCTTCTGT GCCCCTGGCC AGCTGATAAT CCTAGGTTCA TGACCCTTCA CCTCCCCTAA 1020 

CCCCAT^ACAT AGATCACACC TTCTCTAGGG AGGAGKCAAA TGTAGGTCAT GTTTTTGTTG 1080 

GTACTTTCTG TTTTTTGTGA CTTCATGTGT TCCATTGCTC CCCGCTGCCA TGCTCTCTCC 1140 

CTTGTTTCCT TAAGAGCTCA GCATCTGTCC CTGTTCATTA CATGTCATTG AGTAGGTGGG 1200 

TAGCCCTGAT GGGGGTCGCT CTGTCTGGAG CATAACCCAC AGGCGTTriT TCTGCCACCC 1260 

CATCCCTGCA TGCCTGATCC CCAGTTCCTA TACCCTACCC CTGACCTATT GAGCAGCCTC 1320 

TGAAGAGCCA TAGGGCCCCC ACCTTTACTC ACACCCTGAG^AATTCTGGGA GCCAGTCTGC 1380 

CATGCCAGGA GTCACTGGAC ATGTTCATCC TAGAATCCTG TCACACTACA GTCATTTCTT 1440 

TTCCTCTCTC TGGCCCTTGG GTCCTGGGAA TGCTGCTGCT TCAACCCCAG AGCCTAAGAA 1500 

TGGCAGCCGT TTCTTAACAT GTTGAGAGAT GATTCTTTCT TGGCCCTGGC CATCTCGGGA 1560 

AGCTTGATGG CAATCCTGGA AQGGTTTAAT CTCCTTTTGT GAGTTTGGTG GGGAAGGGAA 1620 
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GGGTATATAG ATTGTATTAA AAAAAAAAAG GTATATATGC ATATATCTAT ATATAATATG 1680 

ACGCAGAAAT AAATCTATGA GAAATCTATC TACAAAMWAA AAAAAAAAAA AAAAAAAAAA 1740 

-5 

AGGAATTCGA irTTCAAGCTT ATCGATACCG TCNACC 1776 



10 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHAEyVCTERISTICS : 

(A) LENGTH: 443 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGV: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

20 

ACAGATAAAT AAATAAATAA TAAATTAAAT TAAATAAAAA ATCTGAGCTA ATCTGAATAA 60 

ATTGAGAGAT TTCACATGAA AGCCAGGATT TCTGGCTTCC CAGGAACAGT CAGAAGAGCT 120 

25 AGCTAGCAAC ACTGGTCTGC TTGGCTACCT TCTTTGGAAC AACATGAAAT CTAGCTCCCT 180 

TTTTTTTTTT TTTTTGGCCC ACTTCATCCA TTCACATGAC CTGCCTGGCC TCTGCAGGTA 240 

AGTGAGTATG CAACAAAAAT GTAGCACAGG TTTTGTCGCT GAACTACGTG GTTTCAGGTC 300 

30 

CAGCTCTGCC ACTTGCTAGC ATGACCTCGT GCCGAATTCC NGCACGAAGT TT Trrii n T 360 

TTTTTCAGTG CTCCAGTCCC CCTATTGGAG AATCCTGCCC CCCCCTGGGA CAGAATGTTC 420 

35 ACCCTGGCCC CGCGANTCCC TGA 443 



40 (2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2888 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



50 TTAATGTTGT CAATAACCAC CAGGCCAAAC AGAATTTATA TGACCTGGAT GAAGATGATG 60 

ATGGTATAGC TTCCGTTCCT ACTAAACAGA TGAAGTTTGC AGCCTCAGGC GNCTTTCTCC 120 

ACCACATGGC TGGGCTAAGC AGTTCCAAGC TTTCCATGTC CAAGGCCCTC CCTCTCACCA 180 

55 

AAGTGGTTCA GAATGATGCA TACACAGCTC CTGCTCTCCC TTCCTCTATT CGAACAAAAG 240 

CCTTGACCAA CATGTCCCGG ACACTGGTGA ACAAGGAAGA ACCCCCCAAA GAGCTGCCAG 300 

60 CTGCTGAGCC TGTTCTCAGC CCATTGGAAG GCACCAAGAT GACTGTGAAT AATCTQCACC 360 
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CTCGAGTCAC TGAGGAGGAC ATTGTTGAGC TTTTCTGTGT GTGTGGGGCC CTCAAGCGAG 420 

CTCGACTGGT CCATCCTGGG GTAGCGGAGG TGGTC3TTTGT GAAAAAGGAC GATGCCATCA 480 

CCGCATATAA GAAGTACAAC AACCGGTGTC TGGACGGGCA GCCGATGAAG TCCAACCTTC 540 

ACATGAATGG GAATGTTATC ACCTCAGACC AGCCCATCCT GCTGCGGCTG AGTGACAGCC 600 

10 CATCAATGAA AAAGGAGAGC GAGCTGCCTC GCAGGGTGAA CTCTGCCTCC TCCTCCAACC 660 

CCCCTGCYGA AGTGGACCCT GACACCATCC TGAAGGCACT CTTCAAGTCC TCAGGGGCCT 720 

CTKTGACCAC GCAGCCCACA GAATTCAAAA TCAAGCTTTG AGCAGGGGAG TGAGGCAGCC 780 

15 

AGAAGTGGGG GCAGAGGAGG GTGGCTCTGT TTCCCCAAGG CAAAGCTTAT GACCAATGGG 840 

CCATCGGACT GGAGACCCCT GATTGTGGGA AGGGTTGCCA GGGATAAAGA GCTTCCTCAC 900 

20 TGGATGGGAC CCGCCTTTCT GTGTTGTGTT CTGCCCTGTG CTCTTCTCTC TACGTTAACG 960 

TTTCCTGTAG TATGTTTCTT CATCTCATCG CCAAGGTAGG CTTGTGTTTT TCAGTGTGTG 1020 

CCTCCCCGAG CCTCAGCCCC AAGCTGATTT CTTATCTGGA AATGGTACAC TGAATTCTCT 1080 

25 

GGGTGGCTTT CTTGTGGCCC CATGGGATGC AGCGTGGGGG CTGTCTGAAG GACCCTGCTT 1140 

TTTCCAQGGG CCGAGGGGCT GCCTTTCCTT TGTGTGTATT AAGCTTTTCA AACAATQGAG 1200 

30 GGGATGGAGA GCCCTGGTGT CCTGACGGGA GCCAGGTCGG CCTGAGAGCT GTGCCGCTCC 1260 

TCTGTCTTGT CAGTGGAGGT GCCTGGGTGG GGAGCAGGTC TCAGGCCTCT TGTCCTCTCC 1320 

CCAGTGGCTC CAGGCCTCAC TAGTGGCAAG GGCAGGATGA GGCTGCACCX3 CTGGGAAGAG 1380 

35 

TCTATCTAAG YTCTTGGCTT GGAGTCCCGT GTCGTCTCCR CCCAGAGGAA GTTCTCCAGA 1440 

GTTCACCTTT CCCTTTTCCT TGAGTTGTGC TGAATGCCCC ACCCCAGCTC TCTTTCCCTT 1500 

40 CTQGGTGTCT TTGCTGGGAG GGGGCTGTGT TGTGAGCCCT CCCGGTTCTC ACCTCGCCTG 1560 

GCACTTAACC ACACCCTGGT TTTGTGTAGC CGCCAGCTCT CTTCTGGTTG GGCCTTTGAA 1620 

AGGCTTCAGCC TCCCATTGTG CAGTGCTTGG GTTTGGAGCT TATTTGAATG GAAGAGGTCA 1680 

45 

GTTTGTTCCT GGCTCTCCAT TTCTGGCCTC AGTTGTCTAC AGGACAGTGG TCAGGGATGC 1740 

CTGGAGGCAT ATATCCAGCT GCCACCAAGG GGCACTGTTT GTTCCCACTT ATGTGAGTGA 1800 

50 CCCCATCCAT CCATGACCAG AGGATTATTT TCCTGCCTTG GCAGAGGAGG AGGAGTCAAG 1860 

GGAGCAGGGC AGCTCTACCA GGCAAGGTGT TTCCCCAGCA TAGGCGCAGA CAGTTGGGAC 1920 

GAAACTTCAG AGCCCAGGCA GTCCCTGAAT GACCAGGCCA GTGTTGTCAC TGAGTGGTCC 1980 

55 

CCTGCTGGTT GGGAGTGAAG AGAATCCAGG CTGGCAGAGC TGGAGCCAGT TGGGGAGCAC 2040 

GGTTCTGGGA GCTCTGCAAA ATCAGTAGCA AGTGCTGGAA AAGGCACATG CCGAAGATAC 2100 

60 TCAAGAGCTC CCAAGATTTG CTTGAGGCTA GCCCAGTGAA RAAAACCAGA GACTCATGTT 2160 
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TCCAGGGGTC AGTCTGTCAG GCAGGAAGGA CCCAGGATTT GAACCCAGCT TCAGTGTGCA 2220 

GGCTCTGAGG CTGCCCAGGA CGGGAAAGTC CAAGGAAGGG GCCTGGTGGT GCTCCACITC 2280 

CAGTTCrTTA AAGAATGCTG CTTTTTATTC TCCTAACCCT TTCAAGTGGG TGCAGACTTC 2340 

TCGTTAGCAG CTGGAAGACA TTCCTCCXirAC ACTTTTCCCT TCCTQGCCCA T^QAGCATC 2400 

10 CAGAAGGCAG TAGGACCTGG TTTTTCAGGT ACTGGGAGCC GGGGGCTCAC TGCTTGCACT 2460 

GTGCTTAGGG TAGQGATGGT AAATATCCTC CCTGCATGGC TTTATCCTCC CTCTCATCCC 2520 

AAAGCAGGTA TCTTCTGGTT GTCACAGAGT TTCATTGAGT CXIAGCTGCAG CCACGTGGCC 2580 

15 

ATCTGGAGCT GGTQCTATAG GTGACCATCT GGTACATTGA GGGGACCTGT TTGCCTCCTC 2640 

CACTCTATAA GCAGTCATCT TGGGAGACCG GGAGGAGAAG GTGGTGGGCT AGTCCTGTCT 2700 

20 CCTCCTCCAC TTCCCATC3CC TCTATGTTAC CCATCTGTGT CTCCTGTGCA GAAGGAGAGG 2760 

AAGGGGCATT AAGAGATGAA GGGTGATTAT GTATTACTTA TCCATTTCTG AATAAACATT 2820 

TGTTATTCCT AAAAAAAAAA AAAAAAAACT CX3AGGGGGGG CCCGGWACCC AWATCGCCSK 2880 

AAAGTGAG 2888 



25 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 62: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1851 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOEOGY : " linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

CACTAGTATA ATTTATAATT ATAACCTATT GTGATTTCTT TTCAAATATT AGGTGTCCTA 60 

GTTGCCTATG AAGGTTTGCC ACTTCATCTT GCACTGTTCC CCAAACTTTG GACTGAGCTA 120 

45 TGCCAGACTC AGTCTGCTAT GTCAAAAAAC TGCATCAAGC TTTTGTGTGA AGATCCTGTT 180 

TTCGCAGAAT ATATTAAATG TATCCTAATG GATGAAAGAA CTTTTTTAAA CAACAACATT 240 

GTCTACACGT TCATGACACA TTTCCTTCTA AAGGTTCAAA GTCAAGTGTT TTCTGAAGCA 300 

50 

AACTGTGCCA ATTTGATCAG CACTCTTATT ACAAACTTGA TAAGCCAGTA TCAGAACCTA 360 

CAGTCTGATT TCTCCAACCG AGTTGAAATT TCCAAAGCAA GTGCTTCTTT AAATGGGGAC 420 

55 CTGAGGGCAC TCGCTTTGCT CCTGTCAGTA CACACTCCCA AACAGTTAAA CCCAGCTCTA 480 

ATTCCAACTC TGCAAGAGCT TTTAAGCAAA TGCAGGACTT GTCTGCAACA GAGAAACTCA 540 

CTCCAAGAGC AAGAAGCCAA AGAAAGAAAA ACTAAAGATG ATGAAGGAGC AACTCCCATT 600 

60 
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AAAAGGCX3GC GTGTTAGCAG TGATGAGGAG CACACTGTAG ACAGCTGCAT CAGTGACATG 660 

AAAACAGAAA CCAGGGAGGT CCTGACCCCA ACGAGCACTT CTGACAATGA GACCAGAGAC 720 

_ 5 TCCTCAATTA TTGATCCAGG AACTGAGCAA GATCTTCCTT CCCCTGAAAA TAGTTCTGTT 780 

AAAGAATACC GAATGGAAGT TCCATCTTCG TTTTCAGAAG ACATGTCAAA TATCAGGTCA 840 

CAGCATGCAG AAGAACAGTC CAACAATGGT AGATATGACXJ ATTGTAAAGA ATTTAAAGAC 900 

10 

CTCCACTGTT CCAAGGATTC TACCCTAGCC GAGGAAGAAT CTGAGTTCCC TTCTACTTCT 960 

ATCTCTGCAG TTCTGTCTGA CTTAGCTGAC TTGAGAAGCT GTGATGGCCA AGCTTTCCCC 1020 

15 TCCCAGGACC CTGAGGTTGC TTTATCTCTC AGTTGTGGCC ATTCCAGAGG ACTCTTTAGT 1080 

CATATGCAGC AACATGACAT TTTAGATACC CTGTGTAGGA CCATTGAATC TACAATCCAT 1140 

GTCGTCACAA GGATATCTGG CAAAGGAAAC CAAGCTGCTT CTTGACATTA GGTGTAGCAT 1200 

20 

GTCTACTTTT AAGTCCCTCA CCCCCAACCC CX:ATGCTGTT TGTATAAGTT TTGCTTATTT 1260 

GTTTTTGTGC TTCAGTTTGT CCAGTGCTCT CTGCTTGAAT GGCAAGATAG ATTTATAGGC 1320 

25 TTAATTCTTG GTCAGGCAGA ACTCCAGATG AAAAAAACTT GCATCTTCAG TATACTTCCT 1380 

AAAGGGCAAT CAGATAATGG ATATGTTTTA TGTAATTAAG AGTTCACTTT AGTGGCTTTC 1440 

ATTTAATATG GCTGTCTGGG AAGAACAGGG TTGCCTAGCC CTGTACAATG TAATTTAAAC 1500 

30 

TTACAGCATT TTTACTGTGT ATGATATGGT GTCCTCTGTG CCAGTTTTGT ACCTTATAGA 1560 

GGCAGATTGC CTCCX3ATCGC TGTGGTTCTT ATTATCAAAA TTAAGTTTAC TTGTATACGG 1620 

35 AACAACCACA AGAAATTTGA TTCTGTAAAG AATCCTCTTT AGCTGTGGCC TGGCAGTATA 1680 

TAAATGGTGC TTTATTTAAC- AGAATACCTG TGGAGGAAAT AAAGCACACT TGATGTAAAA 1740" 

ATAATTGTTT TATTTTTATT GACATGACTG ATTGATTGCT ATTCTGTGCA CTTAATTAAA 1800 

40 

CTGATTGTGA TGACTTWWAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 1851 



45 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3542 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

55 

TCCAATGCTG ATGAGCGTCT TCGCTGGCAG GCCAGCTCCT TGCCTGCTGA TGACCTTTGC 60 
ACAGAAAATG CCATCATGCT GAAACGATTC AATAGGTATC CGCTGATCAT TGACCCCTCT 120 
60 GGACAGGCCA CAGAATTCAT TATGAATGAA TATAAGGVm: GTAAGATCAC ACGGACCAGC 180 
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TTCCTCGATG ACGCXTTTCAG AAAGAACTTA GAGAGTGCAC TGAGATTCGG TAACCCCCTT 240 

CTGGTCCAGG ATGTGGAAAG CTACGATCCA GTTTTGAACC CGGTGCTGAA CCGTGAAGTG 300 

CGGCGAACAG GGGGGAGAGT GCTGATCACT CTCGGGGACC AGGACATAGA CCTGTCGCCA 360 

TCGTTTGTCA TCTTCCTGTC CACCCGGGAT CCAACTGTCG AGTTCCCACC AGATCTCTGT 420 

TCCCGGGTTA CTTTTGTAAA CTTCACAGTT ACCCGTAGCA GTTTACAAAG CCAGTGTCTA 480 

AATGAAGTAC TTAAAGCAGA AAGACCTGAT GTGGACGAGA AACGATCTGA TCTTCTTAAA 540 

CTTCAAGGGG AATTTCAGCT CCGTTTGCGT CAGCTGGAAA AATCTCTACT ACAAGCTCTG 600 

AACGAGGTGA AAGGGCGCAT TTTGGATGAC GACACGATCA TAACCACTCT GGAGAACCTG 660 

AAGAGAGAGG CTGCAGAGGT CACCAGGAAA GTTGAGGAGA CX3GACATTGT CATGCAGGAG 720 

GTGGAGACCG TGTCCCAGCA GTACCTCCCG CTCTCCACCG CCTGCAGCAG CATCTACTTC 780 

ACCATGGAGT CCCTCAAGCA GATACACTTC TTGTACCAGT ACTCCCTCCA GTTTTTCCTG 840 

GACA1TTATC ACAACGTCCT ATACGAGAAC CCGAACCTGA AGGGTGTCAC CGACCACACA 900 

CAGCGCCTGT CCATTATAAC AAAGGACCTC TTCCAGGTGG CGTTTAACCG AGTGGCTCGA 960 

GGCATGCTGC ATCAGGACCA CATTACCTTT GCCATGCTGC TGGCAAGAAT CAAACTGAAG 1020 

GGCACCGTGG GGGAGCCCAC CTACGATGCA GAATTCCAGC ACTTCTTGAG AGGAAATGAG 1080 

ATTGTCCTGA GTGCTGGCTC CACCCCCAGG ATCCAGGGCC TGACTGTGGA GCAGGCGGAG 1140 

GCGGTGGTGA GGCTGAGCTG CCTTCCCGCG TTTAAGGACT TGATTGCAAA GGTTCAGGCA 1200 
GACGMCAAT TTGGCATCTG GCTGG ACA GC AGCTCCCCX3G AGCAGACTGT GCCCTACXTTC . _ ^2.60 

TGGAGTGAAG AAACACCTGC AACACCCATT GGCCAGGCCA TCCTWrCGCCT GCTCCTGATC 1320 

CAGGCTTTCC GGCCCGATCG CCTGTTGGCC ATGGCCCACA TGTTTGTTTC AACAAACCTT 1380 

GQGGAGTCTT TCATGTCCAT CATGGAGCAG CCX3CTCGACC TGACCCACAT TGTGGSCACA 1440 

GAGGTGAAGC CCAACACTCC TGTCTTAATG TGCTCTGrTGC CTGGTTATGA TGCCAGTGGA 1500 

CATGTCGAGG ACCTTGCAGC CGAGCAGAAC ACGCAGATCA CTTCAATTGC AATCGGCTCT 1560 

GCAGAAGGCT TTAACCAAGC AGATAAGGCA ATAAACACCG CTGTAAAGTC. GGGCAGGTGG 1620 

GTGATGCTGA AGAATGTGCA TCTGGCCCCA GGGTQGCTGA TGCAGCTGGA GAAGAAGTTG 1680 

CATTCCCTGC AGCCX3CATGC CTGCTTCCGA CTCTTCCTCA CCATGGAGAT CAACCCCAAG 1740 

GTGCCTGTGA ATCTGCTCCG TGCGGGCCGC ATCTTTGTCT TCGAGCCACC GCCAGQGKTG 1800 

AAGGCCAACA TGCTGAGGAC GTTCAGCAGC ATTCCCX3TCT CACGGATATG CAAGTCTCCC 1860 

AACGAGCGTG CCXTGCTTGTA CTTCCTGCTG GCCTGGTTTC ATGCGATCAT CCAAGAACGC 1920 

TTACGATACX3 CACCACTGGG GTGGTCAAAG AACTTATGAAT TTGQAGAGTC TGACCTGCGG 1980 
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TCANYTTGCG ATACGGTGGA CACGTGGCTG GATCACACGG CXAAGGGCAG GCAGAACATC 2040 

TCACCX3GATA AGATCCCGTG GTCTGCACTA AAGACCTTAA TGGCCCAGTC CATTTATGGC 2100 

GGGCX3CGTGG ACAACGAGTT TGACXAGCGT CTGCTCAACA CCTTCCTGGA GCGCCTGTTC 2160 

ACAACCAGGA GTTTCGACAG TGAGTTTAAG CTGGCATGCA AGGTCGACGG ACATAAAGAC 2220 

10 ATTCAAATGC CAGATGGCAT GCAGGCGAGA GGAGTTTGTG CAGTGGGTGG AGTTCCTCCC 2280 

CXSACACCCAG ACGCXXTTCCT GGCTGGGCCT GCCCAACAAC GCCXSAGAGAG TCCTCCTTAC 2340 

CACACAGGGT GTGGACATGA TCAGTAAAAT GCTGAAGATG CAGATGTTGG AGGATGAGGA 2400 

15 

CGACCTGGCC TACGCAGAGA CTGAGAAGAA GACGAGGACA GACTCCACX5T CXX3ACGGGCG 2460 

CCCTGCCTGG ATCCGGACAC TGCACACCAC CGCGTCCAAC TGGCTGCACC TCATCCCCCA 2520 

20 GACGCTGAGC CACCTCAAGC GCACCX3TGGA GAATATCAAG GATCCTTTGT TCAGGTTCTT 2580 

TGAGAGAGAA GTGAAGATGG GCGCAAAGCT GCTTCAGGAC GTTCGCXAGG ACCTTGCAGA 2640 

TGTCGTCCAG GTGTGCGAAG GAAAGAAGAA GCAGACCAAC TACTTGCGCA CGCK3ATCAA 2700 

25 

CGAGCTAGTG AAAGGGATCT TGCCTCGGAG CTGGTCCCAC TACACGGTGC CTGCCGGCAT 2760 

GACCGTCATC CAGTGGGTGT CCGACTTCAG CGAGAGGATC AAACAGCTGC AGAACATCTC 2820 

30 ACTGGCAGCT GCATCTGGTG GCGCCAAGGA GCTAAAGAAC ATCCACGTGT GCCTGGGTQG 2880 

CCTGTTCGTG CCTGAGGCGT ACATCACTGC CACCAGGCAG TATGTGGCCC AGGCCAACAG 2940 

CTGGTCCCTG GAGGAGCTCT GCCTGGAAGT CAACGTCACC ACCTCACAGG GCGCCACCCT 3000 

35 

TGACGCTTGC AGCTTCQGAG TCACX5GGTTT GAAACTTCAA GGGGCCACGT GCAACAACAA 3060 

CAAGCTGTCA CTGTCCAATG CCATCTCAAC CGCCCTTCCC CTGACGCAGC TGCGCTGGGT .3120 

40 CAAGCAGACA AACACCGAGA AGAAGGCCAG TGTGGTAACC TTACXTrGTCT ACCTGAAdT 3180 

CACCCGTGCA GACCTCATCT TCACCGTGGA CTTCGAAATT GCTACAAAGG AGGATCCTCG 3240 

CAGCTTCTAC GAGCGGGGTG TCGCAGTCTT GTGCACAGAG TAAACTTTTC TAGCTGCCCC 3300 

45 

TTTCTGTAAT AGTGAAAC3TT GGTATTTAAC ATTTATTCAT TTTTAAAATA TTTGGAAGGT 3360 

CTGAGCTTGT GAAAAGAAAG TGGTTGGTCT GAGGTTGGAG GAAGCTGAAT GGAATCTGAC 3420 

50 GGTTQGGAGT GGTGGAAATT GGAAGGATAC CAGGAGGTAT TTTGGGAAGGC CAATCGCX3TC 3480 

GCTCCTTTGA GGAAATAAAA CACTAAGCAT GAAAAAAAAA AAAAAACTTA CAANCCNCAA 3540 

55 ^ ^^^^ 



60 



(2) INFORMATION FOR SEQ ID NO: 64: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 883 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 
5 (D) TOPOLOSY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

AGGTGATTTT AATGATAGGT GTCATATATA GGACGGATAA TCTGTTTACA TTCTGTTCTT 60 

CTCGATGCAC TCACAAGCGG GTAACTAGGT GACAAGAAAA CAAAGATCTT ATTCAAAAGA 120 

GGTCTTACAG CAACCCAACG TCTCATCTTC CCATAGTAAA GATGACGGCG CCTTGAGGTA 180 

15 AGCTACAGGC AACACCACTT CCGCGTTTCT CTTGCGCCCT GGTCCAAGAT GGCGGATGAA 240 

GCCACGCGAC GTGTTGTGTC TGAGATCCCG GTGCTGAAGA CTAACGCCGG ACCCCGAGAT 300 

CGTGAGTTGT GGGTGCAGCG ACTGAAGGAG GAATATCAGT CCCTTATCCG GTATGTQGAG 360 

20 

AACAACAAGA ATGCTGACAA CGATTGGTTC CGACTGGAGT CCAACAAGGA AGGAACTCGG 420 

TGGTTTGGAA AATGCTGGTA TATCCATGAC CTCCTGAAAT ATGAGTTTGA CATCGAGTTT 480 

25 GACATTCCTA TCACATATCC TACTACTGCC CCAGAAATTG CAGTTCCTGA GCTGGATGGA 540 

AAGACAGCAA AGATGTACAG GGGTGGCAAA ATATGCCTGA CGGATCATTT CAAACCTTTG 600 

TGGGGCCAGG AATGTGCCCA AATTTGGACT AGCTCATCTC ATGGCTCTGG GGCTGGGTCC 660 

30 

ATGGSTGGCA GTGGAAATCC CTGATCTGAT TCAGAAGGGC GTCATCCAAC ACAAAGAGAA 720 

ATGCAACCAA TGAAGAATCA AGCCACTGAG GCAGGGCAGA GGGACCTTTG ATAGGCTACG 780 

35 ATACTAWTTT CCTGTGCATC ACACTTAACT CATCTAACTG TTCCCCGGAC ANCCTCCACT 840 



40 



CTAGTTGTTA CTAAGTWTG 'CAGTAGCATT NTGGGGAAGA ACA 883 



(2) INFORMATION FOR SEQ ID NO: 65: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

GGCACGAGGT GGCCTCTACC CTGGGCTCAT CTGGCTACAC AGGGACTCTA AACGCTTCCA 60 

GATTCCCTGG AAACATGCCA CCCGGCATAG CCCTCAACAA GAAGAGGAAA ATACCATTTT 120 

TAAGGCCTGG GCTGTAGAGA CAGGGAAGTA CCAGGAAGGG GTGGATGACC CTGACCCAGC 180 

' TAAATGGAAG GCCCAGCTGC GCTGTGCTCT CAATAAGAGC AGAGAATTCA ACCTGATGTA 240 

60 TGATGGCACC AAGGAGGTGC CCATGAACCC AGTGAAGATA TATCAAGTGT GTGACATCCC 300 
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(2) INFORMATION FOR SEQ ID NO: 66: 



TCAGCCXXIAG GGCTCGATCA TTAACCCAGG ATCCACAGGG TCTGCTCCCT GGGATGAGAA 360 

^ GGATAATGAT GTGGATGAAG AAGATGAGGA AGATGAGCTG GATCAGTCGC AGCACCATX3T 420 

TCCCATCCAG GACACCTTCC CCTTCCTGAA CATCAATGGT TCTCCCATGG CGCCAGCCAG 480 

TGTGGGCAAT TGCAGTGTGG GCAACTGCAG CCCGGAGGCA GTGTQGCCCA AAACTCAACC 540 

10 CCTGGAGATG GAAGTACCCC AGGCACCTAT ACAGCCCTTC TATAGCTCTC CAGAACTGTC 600 

GATCAGCTCT CTCCCAATGA CTGACCTGGA CATCAAGTTT CAGTACCX?IO GGAAGGAGTA 660 

CGGGCAGACC ATGACCGTGA GCAACCCTCA GGGCTGCCGA CTCTTCTATO GGGACCTCGG 720 

TCCCATGCCT GACCAGGAGG AGCTCTTTGG TCCCGTCAGN CTGGAGCAGG TCAAATTCCC 780 

AGGTCCTGAG CATATTACCA ATGAGAAGCA GAAGCTGTTC ACTAGCAAGC TGCTCGACGT 840 

20 CATGGACAGA GGACTGATCC TGGAGGTCAG CGGTCATGCC ATTTATGCCA TCAGGCTGIG 900 

CCAGTGCAAG GTGTACTGGT CTGGGCCATG TGCCXXATCA CTTGTTCCTC CXAACCTCAT 960 

TGAGAGACAA AAGAAGGTCA AGCTATTTTG TCTGGAAACA TTCCTTAGCG ATCTCATTGC 1020 

25 

CCACCAGAAA GGACAGATAG AGAAGCAGCC ACCGTTTGAG ATCTACTTAT GCTTTGGGGA 1080 

AGAATGGCCA GATGGGAAAC CATTGGAAAG GAAACTCATC TTGGTTCAGG TCATTCCAGT 1140 

30 AGIGGCTCGG ATGATCTACG AGATGTTTTC TGGTGATTTC ACACGATCCT TTGATACTGG 1200 

CAGTGTCCGC CTGCAGATCT CAACCCCAGA CATCAAQGAT AACATCGTTG CTCAGCIXSAA 1260 

GCAGCTGTAC CGCATCCTTC AAACCCAGGA GAGCTGGCAG CCCATGCAGC CX::ACCCCCAG 1320 

35 

CATGCAACTG CCCCCTGCCC TGCCTCCCCA GTAATTGTGA ATGCCATCTT CTTCCTTCTC 1380 

TTTTTTATAA TATTGTACAT ATGGATTTTT 1TATTGTTTA GATTTAACCA GCTTTTAAAT 1440 

40 CTCTGTTTTC TGTGACAGTG TTAGAAGTTT GTGATTCTCC AAATATGCCT AGATTTAAAG 1500 

CTGATTTAAT TTATGGAAAA AAAAAAAAAA AAAAAAAAAA A 1541 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

AGAAAATGAA TGTTAGAAGG TGCCTGCCGA GGCGGGACAG AGTGTTTGCT CGCGCTGGAG 60 
AAGGCTCTGC TCAGCCCTGA GAGTCCCTTC CTGCCCCACC GATACTGGCA CTTTAAAAAG 120 

60 
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GAAGCTGACC GCACAGTGTC CAGACGAATT GGCCCCCAGA AGATGGGGAG TTCTGTCCTG 180 

CX:CTTCTGTG TCTGCGTGAC CTCACCCAGC CTAGGAGGGA GGTGCATTCA GGGTAGATTT 240 

5 GCCTCTCATT CAAAGTTCTG GGGCTTTGGG CGGAAAACAG CCAGCTTTGG CGCTGTTGGG 300 

GAGACTCCTC CAGACCAGGA ACXTCCAGAAG GAGACAGAGC CTGCCACATC CTCCCACGCC 360 

AGGCCCTGGG CCAGGGTGAT TGGACTGAGA ATTTGGCCAC AACCAAATTG ATGCTGGCTG 420 

GAACCAGAGG CCAGAAAGCC TGGCCTTGTC CCCATGTGGG AGCCCTGTCC TCAGCCCTCT 480 

TGTCCCCTTG AGCICAGTGA ATTCCCACCA GGTGCCCACA GCTCCTGGAC TTCAAATTCT 540 

15 ATATATTGAG AGAGTTGGAG AGTATATCAG AGATATTTTT GGAAAGGAGT TGGTCTATGC 600 

AATGTCAGTT TQGAATCTTC TTGAAAGTTT AATGTTTTTA TTAGGAGATT TAAAGAAAAT 660 

AAAGGTCTAC AATATCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 720 

AAAAAAAAAA AA 732 



10 



20 



25 



35 



55 



(2) INFORMATION FOR SEQ ID NO: 67: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 629 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDSSIESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TTAAGGAATT CGGCMCGATC CCGGCAAGTA ACATGACTAA AAAGAAGCGG GAGAATCTGG 60 

GCGTCGCTCT AGAGATCGAT GGGCTAGAGG AGAAGCTGTC CCAGTGTCGG AGAGACCTGG 120 

40 AGGCCGTGAA CTCCAGACTC CACAGCCGGG AGCTGAGCCC AGAGGCCAGG AGGTCCCTGG 180 

AGAAGGAGAA AAACAGCCTA ATGAACAAAG CCTCCAACTA CGAGAAGGAA CTGAAGTTTC 240 

TTCGGCAAGA GAACCGGAAG AACATGCTGC TCTCTGTGGC CATCTTTATC CTCCTGACGC 300 

45 

TGGTCTATGC CTACTGGACC ATGTGAGCCT GGCACTTCCC CACAACCAGC ACAGGCTTCC 360 

ACTTGGCCCC TTGGTCAGGA TCAAGCAGGC ACTTCAAGCC TCAATAGGAC CAAGGTGCTG 420 

50 GGGTGTTCCC CTCCCAACCT AGTGTTCAAG CATGGCTTCC TGGCGGCCCA GGCCTTGCCT 480 

CCCTGGCCTG CTGGGGGGTT CCGGGTCTCC AGAAGGACAT GGTGCTGGTC CCTCCCTTAG 540 

CCCAAGGGAG AGGCAATAAA GAACACAAAG CTGAAAAAAA AAAAAAAAAA AACTCGTAQG 600 

GGGGGCCCGT ACCCAATCGC CCTOTCGTG 629 



60 
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(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1751 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68; 

CTGCTAGCCG GCCGGCGCAG GCTGCCGAGC GGGTGAGCGC GCAGGCCAGG CCAAAGCCCT 60 

GGTACCCGCG CGGTGCGGGC CTCAGTCTGC GGCCATGGGG GCGTCCQCGC GGCTGCTGCG 120 

AGCGGTGATC ATGGGGGCCC CGGGCTCGGG CAAGGGCACC GTGTCGTCGC GCATCACTAC 180 

ACACTTCGAG CTGAAGCACC TCTCCAGCGG GGACCTGCTC CGGGACAACA TGCTGCGGGG 240 

CACAGAAATT GGCX3TGTTAG CCAAGGCTTT CATTGACCAA GGGAAACTCA TCCCAGATGA 300 

TGTCATGACT CGGCTGGCCC TTCATGAGCT GAAAAATCTC ACCCAGTATA GCTGGCTGTT 360 

GGATGGTTTT CCAAGGACAC TTCCACAGGC AGAAGCCCTA GATAGAGCTT ATCAGATCGA 420 

CACAGTGATT AACCTGAATG TGCCCTTTGA GGTCATTAAA CAACGCCTTA CTGCTCGCTG 480 

GATTCATCCC GCCAGTGGCC GAGTCTATAA CATTGAATTC AACCCTCCCA AAACTGTGGG 540 

CATTGATGAC CTGACTGGGG AGCCTCTCAT TCAGCGTGAG GATGATAAAC CAGAGACGGT 600 

TATCAAGAGA CTAAAGGCTT ATGAAGACCA AACAAAGCCA GTCCTGGAAT ATTACCAGAA 660 

AAAAGGGGTG CTGGAAACAT TCTCCGGAAC AGAAACCAAC AAGATTTQGC CCTATGTATA 720 

TGCTTTCCTA CAAACTAAAG TTCCACAAAG AAGCCAGAAA GCTTCAGTTA CTCCATGAGG 780 

AGAAATGTGT- GTAACTATTA ATAGTAAGAT" GGGCAAACCT -CCTAGTCCTT GCATTTAGAA 840 " 

GCTGCTTTTC CTAAGACTTC TAGTATGTAT GAATTCTTTG AAAATTATAT TACTTTTATT 900 

TCTACTGATT TTATTTTGGA TACTAAGGAT GTGCCAAATG ATTCGGATAC TAAGATGCAT 960 

CGTTTGAAAT CATCTAGTGT GTTGTATGCA GTTATCCTCA AAAACATCAG CGATGTCTGA 1020 

ACCTTTAAAA CATCTGTTAG AGCAAAATTA AAAGAGCATT TGGTAGTAAT CTAACTTTTT 1080 

GTTCAGTTAA TAAGTGGTTG ATAAAGTTTC CATATTTTTC TGGAAAAGTT AAAAAAAGTT 1140 

ACATGTCATT TGGAGAAAAT ACGTAATCAG AAATTTGTGC ATAGATTGAT GCCAAAAAAG 1200 

ACATTTCCAG CATTGTGGAA CATGGTGAGA CACTATATAA AATTCCAGAA AGAAAGCAAC 1260 

TGGATTTACA GATTTATTGT GAGACACAAA TTCACTGCTG CCTTTACACT AAGAAATGTA 1320 

TATGTTAACC ATATATGCTG TATTTATTTT GTCGTTAAGC ATACTTTCAG TTTACTCAGA 1380 

ATTTTCAATT TGCTATAAAG ATGTATCAAT TAGCATATAG AAAAATATTA CTTTAAGATG 1440 

ACTTGTTTCC TTTGAAAATA CCTGTGTACT GAGQGTTATG ATTTGTGTCA AAAATTGACA 1500 
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TAAGTGCTTT TACAAGCACC AAAGTTGAAT GAAITTTCAA CAAAATGTAA TTAAAGTCTA 1560 

TGTTTTCAGT TATGACTCAG GTTAAGAAAT GTGTTTTAGG ATCTACTTGC TGGTrrTTTCT 1620 

TTTTGATCCA AATGTGTGAT CTGCCCTGAT AAATAACAAG TTATNGTACC ATCTCCCCCG 1680 

CCAATAAAAA AAAAAAAAAA AAAAAAAAAC TCGAGGGGGG GCCCGGTACC CAATTCTCCG 1740 

NAATAGGENAG T 1751 



(2) INFORMATION FOR SEQ ID NO: 69: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED^aESS : double 
20 (D) TOPOIXX3Y: linear 

(xi) SEQXmJCE DESCRIPTION: SEQ ID NO: 69: 

GGCACGAGAT TATGTATTAA AATGTTTTTG AATTGTGAAA TATTAGAATA TTGTTACTAT 60 

25 

TTGACCCAAC TCAAAATCTC CATGGGAAAA TACCTGTCGA TACCCACAGT ATTGTTGAAA 120 

ATAATCAGAT GCAGTATCAC AGCTGTGTCA GACTCTAGTA CCAGTTGGGC AATCAAGGCA 180 

30 CAGCTAAAAA TTGAAAACAA AGATCTGGAC AACAAAACAG CCAAAGGTGG GGGTCAAGAA 240 

GCTCTGACGl- GTACCTAGCT GTAGAATGCT ATGCACACGT GCCAGGTGTA GTGTGCATAT 300 

CCAGGAAAAA CTGCAGAGAG CCCCAGTCTT CACCTCTGGT TGACCATGAG CTCTGTGTAA 360 

35 

. _ GCAGGAAG^TG, AAQGCTAAGQ .CAGATTTAAG_<^^ 42C)_ 

ATCGTGCAAA GCATTAAGGA AATCTTGTTA CTGCTAAGTG TTGCTGACCC AGGAACAACT 480 

40 CCTACTCAGC TGGACTTAAA AATAAAAA 508 



45 (2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEENESS: doxjble 

(D) TOPOLOGY: lineax 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
55 TACATAGAGC AAAGAGAAAT TTCCAGAATT TCTARAATTC TGGAAAGAGA ATTTTCCTGA 60 
GATTGCAGAT TTGCTTGTGT CCTCAGGTGA TGATGAGGGC TGTTTTCCCC TGTTGTCCTT 120 
TCCTCACACT CATGCTTCCT CTCCTAGAGT GTCTGGTTGG CATGATCATG TGCTACCTAG 180 

60 
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GCATTTCTTT CACTGATACA . AGGAAAACTG CAGGGTTAAA AAAAAAAAAA AAAAAAAAAA 240 



NCNCG 



245 



(2) INFORMATION FOR SEQ ID NO: 71: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
ATGTTCCTCA TGAGGATGCA CTTGTGCTTC TGCAAGTATT GCTGCAGCTT CATAGTGACT 60 
20 CCCACCAGCA CCAGCAATAC AGCTAGCTAC CTGTGGCCTT GGATCTCAGC CAGCATGGCT 120 
GGGAGAGGGA GCAGCTGGGC ATGTACCCTA AATGCTGTTA CCAGGGAAGG ACTCCCAGAG 180 
TGAAGACAAG TAGGGACTTC CTGCAGAGGT GGTACATGTG CTCTCTGTAT CCATACTTTT 240 

25 

TTTTmTTT TTTTGAGATA GAGTTTCACC CTTCTTTGCCC TGGCTGGAGT GCAATGGTGC 300 
GATCTCAGCT CACTGCAACC TCTCTGCCTC CCGGGTTCAA GTGATTCTCC TGCCTCAGCC 360 
30 T 361 



35 (2) INFORMATION FOR SEQ ID NO: 72: 

i±y SEQUENCE -CHARACTERISTICS": - - 

CA) LENGTH: 713 base pairs 
(B) TYPE; nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

45 AGGATCACAC AATAGAGAAC ACTGTAGTAA CATTTCGGTC TGCTCACAAG ACCCAGAACA 60 

TTGATCAGTT TTTGTTGTTG GTTTATTATT TTTCTGTTAA AAAATTGTGA AAAGTTTGTT 120 

TTAGCTAGAT GATATTTTAA TAGCTGCGAG TGCTTTQGAA CTATAAAGAT GTCACTACTT 180 

50 

AACACACATA CCTTATGTTT TGTTTTGTTT TGTTTTACAC TCAGTATAAA TCAGGAGAAG 240 

TTAGCCAACC ATCTAGCATT TAGAATCCTC TTTTTTATTG TCTTCTAAGG ATATQGATGT 300 

55 TCCCATAACA GCAACAAAAC AGCAACAAAA ACATTTCATA AATATCACTT GATAGACTGT 360 

AAGCACCTGC TTAACTTTGT GTCCCAAATA TTTAGTGTGT ATATATATAT ATATATATAC 420 

ACACACACAC ACATATATAT TCAACAAATA AAGCAAAATA TAACATGCAT TTCACATTTT 480 

60 
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GTCTTTCCCT GTTACGATTT TAATAGCAGA ACTGTATGAC AAGTTTAGGT GATCCTAGCA 540 
TATGTTAAAT TCAAATTAAT GTAAAACAGA TTAACAACAA CAAAGAAACT GTCTATTTGA 600 
5 GTGAAGTCAT GCTTTCTATT ATAATAACTT GGCTTCGGTT ATCCATCAAA TGCACACTTA 660 
TACTGTTATC TGATTGTTTA TAATAAAGAA TACTGTACTT ATAAAAAAAA AAA 713 



10 



(2) INFORMATION FOR SEQ ID NO: 73: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

GAAAGTCAGA GCTGTCCAAT CCCTCAGCAC CTTTTAGATT TGCTCCAAAT TAGAAAOGTG 60 

GGGACTATGT GTTCTGGGCA ATCACAGGTC TGGAAAATGG CTCTGCAGGC TCTTGATAGT 120 

25 

GAGACAGTGG TCATCTrACC AGACATGCAT CTGATTTTAA GCCTCAGGCT AATCCACAAT 180 

GCTCGGCCAT GCCTATGATT AACAAACAAA AGCAAAATCT GCTTTTATAG TTTAGGAAAC 240 

30 CTGGATAGAA CAGTATTTTT CAGCATTCTT GGATAAAGCA GTTCTGCATT TTTAAATTGG 300 

GACTGCAGAA GTGACTGTCT ATAGTTGTGA AATACAAAAA ATGGTATGIT TGATCAGAAA 360 

AGGAAGCCCG TGCCTGGCAC TTGGAAAGAT ACTGAGCATC ATAACCCTAA TGAGAAAATG 420 

35 

TAGGCrCTGT GAATGTTAAC TACAAATO^G GTTAGGAT^^ CATATGACAC CCTTTGTCAA _ 480 

ACTAAGCTTC ACTAGGAGGA CCTGTGCTCA TAGAAGAATA TGCTTTAAAA GTATCAATTT 540 

40 TCCACAGTCG ATGATGGAGA AAAGTTCATT TGCACCAGAA TGCTGATAGT CACAATACAC 600 

AGCCTGACAT ATATAACAAT ACAGTTTTCT GTAAACAGAA GTTCTTCCTC TTCCAATTCA 660 

GGAGTCAGTC AGAGCATAAA TATTGCATGT TTCACTTTAG AAACTGATTC ATTTTAGAAA 720 

45 

GCAGATCTGG ATTATTTTGC AGGGTAGAAA TGAAGGCTAT TTCTGGCATT CTTGCTCAAA 780 

AAGTCAATAT ATGTACATTA AGTATAAAAA AGGGTCTCTT TCACCTCTTT TGTTTCGTAG 840 

50 CATTGGCTAC ATAACTCGTG CC 862 



55 (2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4602 base pairs 
(B> TYPE: nucleic acid 
60 (C) STRANDEI»JESS: double 



wo 98/39448 



PCT/US98/04493 



295 

(D) TOPPIiOGy: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

- 5 GCGAGGGGGC GKGGGGAGCA GCGCCGARGC CGCCGCCTCC GCCTCCGCCG CCTAGGACTA 60 

GGGGGTGGGG GACGGACAAG CCCCGATGCC GGGGGAKACG GAAGAGCCGA GACCCCCGGA 120 

GCAGCAGGAC CAGGAAGGGG GAGAGGCGGC CAAGGCGGCT CCGGAGGACC CGCAACAACG 180 

10 

GCCCCCTGAG GCGGTCGCGG CGGCGCCTGC AGGGACCACT AGCAGCCGCG TGCTGAGGGG 240 

AGGTCGGGAC CGAGGCCGGG CCGCTGCGRC CGCCGCGCMG CAGCTGTGTC CCGCCGGAGA 300 

15 AGGCCGAGTA TCCCCGCCGG CGAGGAGCAG CCCCAGCGCC AGGCCTCCCG ACGTCCCCGG 360 

GCAGCAGCCC AGGCCGCGAA GTCCCCGTCT CCAGTTCAGG GCAAGAAGAG TCCGCGACTC 420 

CTATGCATAG AAAAAGTAAC AACTGATAAA GATCCCAAGG AAGAAAAAGA GGAAGAAGAC 480 

20 

GATTCTGCCC TCCCTCAGGA AGTTTCCATT GCTGCATCTA GACCTAGCCG GGGCTGGCGT 540 

AGTAGTAGGA CATCTGTTTC TCGCCATCGT GATACAGAGA ACACCCGAAG CTCTCGGTCC 600 

25 AAGACCGGTT CATTGCAGCT CATTTGCAAG TCAGAACCAA ATACAGACCA ACTTGATTAT 660 

GATGTTGGAG AAGAGCATCA GTCTCCAGGT GGCATTAGTA GTGAAGAGGA AGAGGAGGAG 720 

GAAGAAGAGA TGTTAATCAG TGAAGAGGAG ATACCATTCA AAGATGATCC AAGAGATGAG 780 

30 

ACCTACAAAC CCCACTTAGA AAGGGAAACC CCAAAGCCAC GGAGAAAATC AGGGAAGGTA 840 

AAAGAAGAGA AGGAGAAGAA GGAAATTAAA GTGGAACTAG AGGTGGAGGT GAAAGAAGAG 900 

35 GAGAATGAAA TTAGAGAGGA TGAGGAACCT CCAAGGAAGA GAGGAAGAAG ACGAAAAGAT 960 

GACAAAAGTC CACGTTTACC CAAT^GGAGA AAAAAGCCTe CAATCCRGTA TGTCCGTTGT " r020 

GAGATGGAAG GATGTGGAAC TGTCCTTGCC CATCCTCGCT ATTTGCAGCA CCACATTAAA 1080 

40 

TACCAGCATT TGCTGAAGAA GAAATATGTA TGTCCCCATC CCTCCTGTGG ACGACTCTTC 1140 

AGGCTTCAGA AGCAACTTCT GCGACATGCC AAACATCATA CAGATCAAAG GGATTATATC 1200 

45 TGTGAATATT GTGCTCGGGC CTTCAAGAGT TCCCACAATC TGGCAGTGCA CCGGATGATT 1260 

CACACTGGCG AGAAGCATTA CAATGTGAGA TCTGTGGATT TACTTGTCGA CAAAAGGCAT 1320 

CTCTTAATTG GCACATGAAG AAACATGATG CAGACTCCTT CTACCAGTTT TCTTGCAATA 1380 

50 

TCTGTGGCAA AAAATTTGAG AAGAAGGACA GCGTAGTGGC ACACAAGGCA AAAAGCCACC 1440 

CTGAGGTGCT GATTGCAGAA GCTCTGGCTG CCAATGCAGG CGCCCTCATC ACCAGCACAG 1500 

55 ATATCTTGGG CACTAACCCA GAGTCCCTGA CGCAGCCTTC AGATGGTCAG GGTCTTCCTC 1560 

TTCTTCCTGA GCCCTTGGGA AACTCAACCT CTGGAGAGTG CCTACTGTTA GAAGCTGAAG 1620 

GGATGTCAAA GTCATACTGC AGTGGGACGG AACGGGTGAG CCTGATGGCT GATGGGAAGA 1680 

60 
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TCTTTGTGGG AAGCGGCAGC AGTGGAGGCA CTGAAGGGCT GGTTATGAAC TCAGATATAC 1740 

TCGGTGCTAC CACAGAGGTT CTGATTGAAG ATTCAGACTC TGCCGGACCT TAGTGGACAG 1800 

GAAGACTTGG GGCATGGGAC AGCTCAGACT TTGTATTTAA AAGTTAAAAA GGACAAAAAA 1860 

AAAATCTAAA GCATTTAAAA TCTAGTQAAA TAACTGAAGG GCCTGCTCTT TCCATTGTGG 1920 

ATCACAGCAC ACACATACAT ACACCCTCCA CCTCCCCATC CCCTGTTCTC CCTCTGTTCC 1980 

TCCCCTTATA AAATTGATGT TGTCTTTACC AGAAAGGTAG ACAAAAAAGA AGCAGCAGCA 2040 

GCTCTTAAAG TGAGGGTTAT TCTCATACTC GGTTCCAGCC ATCAGCAGAC TTCCTGCTCA 2100 

TCGGCAGATC CCXTCTTTCCA ACCTGTAACT CTGATGTGCT CTGGATCAGC TTTTAACTTT 2160 

TAATCATATA TTACTGTCTT CTAAATCCCT TCTCXTTCCTC TACTGCTGCC CTATGGTTCT 2220 

GGCTCCTACC CCCTGCGGCA CACTTATCTT CAAATACCAT AGAATTCTAA TCTCTGAAAT 2280 

CATAGCTCTC CAGTGGCTTT TAAAGAAAGG TGGTCCTCAG CACTAACAAA ATCACTACAA 2340 

TAGCCTAGTG CTTTTTTGGA AGCCTTTTTA GGGAAGAATG TTAGGTTCAT GGTAACTAGT 2400 

ATGCTCTTTG AGA1TTTTAC AGTGITGAAA CTTAAGAATT TTGAGAGGGT GAGGAGGGTT 2460 

GTTCAGAATC TAAATTACAG ATAGATGATT GTTTCTTGTG AATTTGTTTC TTOTCCTTrr 2520 

TTTTTGTCCC TACCATTTCC TTACATTTCC CTTGGGGCCC ATCTCTGGCT CCTTGCTTTT 2580 

TGTTTCTTGC TTTGCTTTAT CAGTTCATTC CAGCTCCCTG TTAGTGAAGG ACACTGCTGT 2640 

TAGTGAAGGA ACAAAGTCTA TGAGTCCTAA AATTTTAAGT CAAAGAAAAC TGCTCTGTTT 2700 

CCCCTTTAGT AACACTTCTG AAGAGGAAAA ACTTCAATAG CCAAAGTTAA TAATCCTATA 2760 

TAATAATTGC TTTGGC^TTC ACOTAAAATT CT^COGCATCA CAAITICCIT GGGATaS^ 2820 

TTGTGTTGGG GAATAGATTG CITATTGCTG TTCACTGGAG AGAAAAGGTA GTGTTTTTGT 2880 

ACAAGGTCAT ACCGCCAGAA GCCCCAAATC CTATTTTGGC TCATCTTCAG GTAAAGAGTA 2940 

ATTCCTATCC TCTGTGCCTC AGAAGCTAGA ATCGAAGGCT TACCCTATTC ATTGTTTATT 3000 

GTCAGAAATG CATGATGGCT CTTGGAAAGA ATGACGTTTT GCTGGAAAAA AAAAAAARAA 3060 

CMGTTTGTGT TTCACAAACA TQGCTTATCA ATTTTTTCAA AGAATTCTTT TTTCCCAAAA 3120 

AGAGGAGTAA CAAAATGTCA TTTCTGAAAG AGGCTTACTT TATACCAACT AGTGTCAGCA 3180 

TTTGGGATGC CAGGGAACAG AGAGTGAGAC ACCTACAATC ACCAGTCTCA AATGCGCTAT 3240 

TGTTTCTTTT CAGAGTGTTG CAGATTTGCX: ATTTCTCCAT AATATGGGGA TAGAAAATGG 3300 

AATAAAGATA GAAGGGATGT AGAATATGCT TTCCTGCX::AA CATGGTTTGG AGTCGACTTT 3360 

GGTATATTGA CTAGATTTGA AAATACAAGA TTGATTAGAT GAATCTACAA AAAAGTTGTC 3420 

CTCCTCTCAG GTCCCTTTTA CACTTTTTGA CTAACTAGCA TCTATATTCC ACACTTAGCT 3480 
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TTTTTGTCAC 
ATTTTAGCCA 
AATGTATTCT 
ATTAGAAACT 
CCATTTGCCT 
TTATATTTTT 
TCAGCCGTTT 
TTTTAATCTT 
CTCTTCCCAC 
AGATACAGCT 
GTrGTTTTGC 
ATC3GCCGTGG 
AGAGCACTTG 
TCTAACTAAG 
TTTCCCACTT 
TTTTATTTCT 
GGAGAGACAG 
AAAAGGGATA 
AAATAAATAT 



ACTTATCCTT 
CCTACACAAA 
CCTTTCCTAT 
TGCTGTACTT 
ACAATAAATT 
CCTTTTGATA 
CCCTGCTTTC 
AAAGTTCTAC 
TGCATATTTC 
ATTTAATATT 
GTAACTGAGA 
GAGTACTGGG 
AACCTCCnT 
ATTATTATAG 
TCCAATGTAC 
TAAAAGCCTT 
GTAGTTACCT 
CTTTGCTTTT 
GTAAAGAAGA 



TGTCTCCGTA 
AGCAAACTGC 
ACCGCTCTCC 



ATTGCAGCAG 
AAGGGATGCT 
CCTTCTGCTC 
ATTTCATGCT 
CATCTTGAAT 
TCTGGGAGAT 
CTCCTTGATA 
AGTAAAATAA 
GTACCTGTTT 
TCTGGTTGTT 
TCAAGAAAAT 
TTTTGCCTGT 
GAATATAGGT 
CAAATCTTTG 
AAAATAAAAG 



AATTTCATTT GCAGTGGTTA 
ATTTTTAAAA ATCTTTCTGA 
CAACAAAAAA ACAACTAGTT 
TTTAGGGGTC AAGGACCCTC 
TTTGCAATAC TAAAATATTT 
GCATAGTAGA GTTGGTGTAA 
CATATGCCTC ATTGTCCTTC 
CTTAGTCAAA TTCTGTTACC 
TGGTGGTTCT AAATTCTGAA 
GTGCATCCCT CTTCTTTGTG 
TGCTTCAGAG AATTTAGGCA 
AAATATCGAG GTATAGACTA 
GGGGAAAAAG TATAATGAGT 
TGAAATACCA TTTTTTTCTC 
TGAACAAATG TAATGGATCA 
TGTAATGTGC AGGACCCTTC 
TGAAAAGGTT ATGTAAAAAG 
TTTTCTCTTA TTCTAGGTAA 
TTGTCTTCAT^GG 



GTCATCAGAT 
GATOGGAGAA 
AGTTCTACTA 
TTTATAGCTA 
TTTATAGACT 
TTAAACTATC 
CAGGGAQCTC 
TTTTTAATAA 
ACTGTAGTTG 
GTTGCCCAAG 
AACACTGGCC 
GCATCCACAT 
GTACTACCAA 
CTTTTGTGTT 
ATTTAAAATA 
TCCTTTCATG 
AAATTATAAT 
GGCATATTAA 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1255 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CGCGCCCCGG GCCQGCGGGT TTCTCTAACA AATAAACAGA ACCCGCACTG CCCAGGCGAG 
CGTTGCCACT TTCAAAGTGG TCCCCTGGGG GAGCTCAGCC TCATCCTGAT GATGCTGCCA 
AGGCGCACTT TTTATTTTTA TTTTATTTTT ATTTTTTTTT TAGCATCCTT TTGGGGCTTC 
ACTCTCAGAG CCAGTTTTTA AGGGACACCA GAGCCGCAGC CTGCTCTGAT TCTATGGCTT 
GGTTGTTACT ATAAGAGTAA TTGCCTAACT TGATTTTTCA TCTCTTTAAC CAAACTTGTG 



3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4602 



60 
120 
180 
240 
300 
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GCCAAAAGAT ATTTGACCGT TTCCAAAATT CAGATTCTGC CTCTGCGGAT AAATATTTGC 360 

CACGAATGAG TAACTCCTGT CACCACTCTG AAGGTCCAGA CAGAAGGTTT TGACACATTC 420 

.5 

TTAGCACTGA ACTCCTCTGT GATCTAGGAT GATCTGTTCC CCCTCTGGAT GAACATCCTC 480 

TGATGATCAA GGCTCCCAGC AGGCTACTTT GAAGGGAACA ATCAGATGCA AAAGCTCTTG 540 

10 GGTGTTTATT TAAAATACTA GTGTCACTTT CTGAGTACCC GCCGCTTCAC AGGCTGAGTC 600 

CAGGCCTGTG TGCTTTGTAG AGCCAGCTGC TTGCTCACAG CCACATTTCC ATTTGCATCA 660 

TTACTGCCTT CACCTGCATA GTCACTCTTT TGATGCTGGG GAACCAAAAT C3GTGATGATA 720 

15 

TATAGACTTT ATGTATAGCC ACAGTTCATC CCCAACCCTA GTCTTCGAAA TGTTAATATT 780 

TGATAAATCT AGAAAATGCA TTCATACAAT TACAGAATTC AAATATTGCA AAAGGATGTG 840 

20 TGTOTTTCTC CCCGAGCTCC CCTGTTCCCC TTCATTGAAA ACCACCACGG TGCCATCTCT 900 

TGTGTATGCA GGGCTATGCA CCTGCAGGCA CX3TGTGTATG CACTCCCCGC TTGTX3TTTAC 960 

ACAAGCTGTG GGGTGTTACG CATGCCTGCT TTTTTCACTT AATAATACAG CTTGGAGAGA 1020 

25 

TTTTTGTATC ACATTATAAA TCCCACTCGC TCTTTTTGAT GGCCACATAA TAACTACTGC 1080 

ATAATATGGA TACGCCTTAT TTGATTTAAC TAGTTCCCTA ATGATGGACT TTTAAGTTGT 1140 

30 TIXXTTTTTT TTTCTTTTTT GCTACTGCAA ACGATGCTAT AATAAATGTC CTTATCAAAA 1200 

AAAAAAAAAA AAAAAAAAAA AAAAAANCCC NGGGGGGGGG CCCCGGGAAC NCAAT 1255 

35 



<2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 475 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

GGCACGAGAG AAATGTTTGA TTCTCTTTCC TATTTTAAGG GATCTTCTCT CTTGTTGATG 60 

TTGAAAACTT ACCTTAGTGA AGATGTGTTT CAACATGCTG TTGTCCTTTA CCTGCATAAT 120 

50 

CACAGCTATG CATCTATTCA AAGTGATGAT CTGTGGGATA GTTTTAATGA GGTCACAAAC 180 

CAAACACTAG ATGTAAAGAG AATGATGAAA ACCTGGACCC TGCAGAAAGG ATTTCCTTTA 240 

55 GTGACTGTTC AAAAGAAAGG AAAGGAACTT TTTATACAAC AAGAGAGATT CTTTTTAAAT 300 

ATGAAGCCTG AAATTCAGCC TTCAGATACA AGGTACATGC CCTCTTTCTT TTCATGCCAT 360 

CTCTTTTGCA CTCTCAGGTG GAAATATTTT GAAGTGTTTT ATAATCATAA GTTCTTGTGA 420 

60 
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AACCTAACAA GATTATCCCT TCCTAAGAAT ACTTAACCTT CCTACCAAAT TAAAA 475 



(2) INFORMATION FOR SEQ ID NO: 77: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

TTCTCTCTGC TdTCGACTG CACCGCACTC GCGCGTGACC CTGACTCCCC CTAGTCAGCT 60 

CAGCGGTGCT GCCATGGCGT GGCGGCGGCG CGAACCRGCG TCGGGGCTCG CGGCGTGTTG 120 

20 GCTCTGGCGT TGCTCGCCCT GGCCCTGTGC GTGCCCGGGG CCCGGGGCCG GGCTCTCXSAG 180 

TGGTTCTCGG CCGTGGTAAA CATCGAGTAC GTGGACCCGC AGACCAACCT GACGGTCTGG 240 

AGCGTCTCGG AGAGTGGCCG CTTCGGCGAC AGCTCGCCCA AQGAGGGCGC GCATGGCCTG 300 

25 

GTC3GGCGTCC CGTGGGCGCC CGGCGGAGAM CTCGARGGCT KCGCGCCCGA CACGCGCTTC 360 

TTCGTGCCCG AGCCCGGCGG CCGAGGGGCC GCGCCCTGGG TCGCCCTGGT GGTCGTGGGG 420 

30 GCTGCACCTT TCAAC5GACAA AGTGCTGGTG GCGGCGCNGA ANGAA 465 



35 (2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE -CHARACTERISTICS : ----------- 

(A) LENGTH: 1907 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

45 ACATGCAGCC CAACTACAGA TTCTTATGGA ATTCCTCAAG GTTGCAAGAA GAAATAAGAG 60 

AGAGCAACTG GAACAGATCC AGAAGGAGCT AAGTGTTTTG GAAGAGGATA TTAAGAGAGT 120 

GGAAGAAATG AGTGGCTTAT ACTCTCCTGT CAGTGAGGAT AGCACAGTGC CTCAATTTGA 180 

50 

AGCTCCTTCT CCATCACACA GTAGTATTAT TGATTCCACA GAATACAGCC AACCTCCAGG 240 

TTTCAGTGGC AGTTCTCAGA CAAAGAAACA GCCTTGGTAT AATAGCACGT TAGCATCAAG 300 

55 ACGAAAACGA CTTACTGCTC ATTTTGAAGA CTTGGAGCAG TGTTACTTTT CTACAAGGAT 360 

GTCTCGTATC TCAGATGACA GTCGAACTGC AAGCCAGTTG GATGAATTTC AGGAATGCTT 420 

GTCCAAGTTT ACTCGATATA ATTCAGTACG ACCTTTAGCC ACATTGTCAT ATGCTAGTGA 480 

60 
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TCTCTATAAT GGTTCCAGTA TAGTCTCTAG TATTGAATTT GACCGGGATT GTGACTATTT 540 

TGCGATTGCT GGAGTTACAA AGAAGATTAA AGTCTATGAA TATGACACTG TCATCCAGC3A 600 

TGCAGTGGAT ATTCATTACC CTGAGAATGA AATGACCTGC AATTCGAAAA TCAGCTGTAT 660 

CAGTTGGAGT AGTTACCATA AGAACCTGTT AGCTAGCAGT GATTATGAAG GCACTGTTAT 720 

TTTATGGGAT GGATTCACAG GACAGAGGTC AAAGGTCTAT CAGGAGCATG AGAAGAGGTG 780 

TTGGAGTGTT GACTTTAATT TGATGGATCC TAAACTCTTG GCTTCAGGTT CTGATGATGC 840 

AAAAGTGAAG CTGTGGTCTA CCAATCTAGA CAACTCAGTG GCAAGCATTG AGGCAAAGGC 900 

15 TAATGTGTGC TGTGTTAAAT TCAGCCCCTC TTCCAGATAC CATTTGGCTT TCGGCTGTGC 960 

AGATCACTGT GTCCACTACT ATGATCTTCG TAACACTAAA CAGCCAATCA TGGTATTCAA 1020 

AGGACACCGT AAAGCAGTCT CTTATGCAAA GTTTGTGAGT GGTGAGGAAA TrGTCTCTGC 1080 

20 

CTCAACAGAC AGTCAGCTAA AACTGTGGAA TGTAGGGAAA CCATACTGCC TACGTTCCTT 1140 

CAAGGGTCAT ATCAATGAAA AAAACTTTGT AGGCXnXSGCT TCCAATGGAG ATTATATAGC 1200 

25 TTGTGGAAGT GAAAATAACT CTCTCTACCT GTACTATAAA GGACTTTCTA AGACTTTGCT 1260 

AACTTTTAAG TTTGATACAG TCAAAAGTGT TCTCGACAAA GACCGAAAAG AAGATGATAC 1320 

AAATGAATTT GTTAGTGCTG TGTGCTGGAG GGCACTACCA GATGGGGAGT CCAATGTGCT 1380 

30 

GATTGCTGCT AACAGTCAGG GTACAATTAA GGTGCTAGAA TTGGTATGAA GGGTTAACTC 1440 

AAGTCAAATT GTACTTGATC CTGCTGAAAT ACATCTGCAG CTGACAATGA GAGAAGAAAC 1500 

35 AGAAAATGTC ATGTGATGTC TCTCCCCAAA GTCATCATGG GTTTTGGATT TGTTTTGAAT 1560 

AITTTITTCT TTnTirTTCTTT TCCCTCCTTT ATGACCTTTG GGACATTGGG^AATACCCAGC 1620 

CAACTCTCCA CCATCAATGT AACTCCATGG ACATTGCTGC TCTTCGTGGT GTTATCTAAT 1680 

40 

TTTTGTGATA GGGAAACAAA TTCTTTTGAA TAAAAATAAA TAACAAAACA ATAAAAGTTT 1740 

ATTGAGCXAC AGTTGAGCTT GGAAAGTTTT TGTCAAATGC NGCAAGAGAT AACTCTTTTT 1800 

45 ANGAAGTAGC ATATGTGAAC TATAATGTAA CAGTGAATAA TTTGTAAAGT TCGTATTTCC 1860 

CAACCTCTTT GQGAATTACA CATATCAATA TAAACAAAAT ATAAAGT 1907 



50 



<2) IMFORMATION FOR SEQ ID NO: 79: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH : 1168 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
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GCTGGGGTGT CCCCKCSGCC ACCATCGTCA TCGCTTACTT GATGAAGCAC ACTCGGATGA 60 

CCCATGACTG ATGCTTATAA ATTTGTCAAA GGCAAACGAC CAATTATCTC CCXIAAACCTT 120 

5 

AACTTCATGG GGCAGTTGCT AGAGTTCGAG GAAGACCTAA ACAACGGTGT GACACCGAGA 180 

ATCCTTACAC CAAAGCTGAT GGGCGTGGAG ACGGTTGTGT GACAATGGTC TGGATGGAAA 240 

10 GGATTGCTGC TCTCCATTAG GAGACAATGA GGAAGGAGGA TGGATTCTGG TTTTTTITCT 300 

TTCTTTTTTT TTTTGTAGTT GGGAGTAAGT TTGTGAATGG AAACAAACTT GTTTAAACAC 360 

TTTATTTTTA ACAAGTGTAA GAAGACTATA ACTTTTGATG CCATTGAGAT TCACCTCCCA 420 

15 

CAAACTGACA AAOTAAGGAG GTTAAAGAAG TAATTTnTT AAGCCAACAA TAAAAATATA 480 

ATACAACTTG TTTCTCCCCC TTTTCCTTTT AAGCTATTTG TAGAGTTTAT GACTAAATAG 540 

20 TCTGTGCAGG TTCATAGACC GAAGATACTA CACACTTTAA ACCAATTAAA AAGAACCAAA 600 

AGTAAATAGA AAAGACATTG AATCACCAAG GCCTGGGATC AACCTGGGCT GTCCACACAG 660 

AAAACAAAAA CCCAACCAAA CCAAGCCCTG TTGTGCTCAC TGGTGCAAAG AGAAGATCAG 720 

25 

GGCAGCTTAA GTGGTCTAAG RATCCTTCAG GCATTCTTTA AGGAGAAAAA GGATACCTTT 780 

GATTTTGTGT GTTTCATGCT CTGGATTTTT TTTTTTTTTC CTTCTCTGQG TTTAAGAGAT 840 

30 TTTTTTTGAA ATAGTGAGGA ACTGACCATT ATATGCCTTC ACTGGCTTCT TGTGCAATAA 900 

TATGATGTTT TAAGTGTGCA AACAAGTTAG AGCTGGCAGC TGAATGATAG ACAAATAGTG 960 

CAAATTTGCC AGCTTGGAGA TAGAAAGGAA TTCAACAATA TATCAAATAC TTTCCTTCCC 1020 

35 

ACCTTrrrCC TTTTTTTTTT TTTTTTCTGA TTTGATTCTG GTTACAGTGC CATAAACCTT 1080 

GTTACATATG TATATCAGAA TGTAAGAAAA AAAAATTTAT TTAAAAATAT TTTTCGCAAA 1140 

40 AAAAAAANNA AAAAACTCGA GGGGGGCC 1168 

45 (2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1285 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

<D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
55 AGAAAATCAC ATCCTAACAA AGAAGTCTGT CTAAGACAGT ACATCTCCTG TTGAACTTGC 60 
ATCTTTCCAC AGGACTTTCT GTTTTTAGGG ATGAGACTAT TCTCTGCTTC ATCAAGGAAA 120 
GAGAAATGTT CAGGGTTGTA GGGATGGCAC ACTTATTAGT TCTGCCTGTC TGAAAGGTTC 180 

60 
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CTGCAGGACA GTTTGGTCAG AGCTGCAATT CTTAGTCCAT GGTCTAATGC TTGAGTATCT 240 

CTTCTTTCCC TTTCCTGTCT CAGGAATCAG CTGAGAATTC ATTCGATTGT CATGCCTCTA 300 

GCCCCTTACT GTGATTTGTT GGTTGCACTT TCATTTGCTT TAGTTCTAGA ATCACCTGTT 360 

GACTCCTCAG ACTTCACCTA ACTTTGGAAA CTCTCTTTTG GAGGCTTCTC ATTTCCCXXTT 420 

AATTCTGTGC TGCCTGAGCC CTAGAATTTT CCCACCAACG AATTATTCCA GGTAGATCCT 480 

AAGTTGCTGG ATCTAGTTGA TATTTAAACA ATATCTAGTT GATATTTCTC ATTCAGTTGG 540 

ATCCAGAAAC CAGTATCTCT NAAAAACAAC CTCTCATACC TTGTGGACCT AATTTTGTGT 600 

15 GCGTGTGTGT GTGCGCGCAT ATGTATATAG ACAGGCACAT CTTTTTTACT TTTGTAAAAG • 660 

CTTATGCCTC TTTGGTATCT ATATCTGTGA AAGTTTTAAT GATCTGCCAT AATOTCTTGG 720 

GGACCTITGT CTTCTGTGTA AATGGTACTA GAGAAAACAC CTATATTATG AGTCAATCTA 780 

20 

GTTGGTTTTA TTCGACATGA AGGAAATTTC CAGATAACAA CACTAAC7VAA CTCTCCXTTTG 840 

ACTAGGGGGA CAAAGAAAAG CAAAACTGAC CATAAAAAAC AATTACCTGG TGAGAAGTTG 900 

25 CATAAACAGA ATTAGGTAGT ATATTGAAGA CAGCATCATT AAACAGTTAT GTTGrrTCTCC 960 

TTGCAAAAAA CATGTACTGA CTTCCCGTTG AGTAATGCCA AGTTGTTTTT TTTATTATAA 1020 

AACTTGCCCT TCATTACATG TTTCAAAGTG GTGTGGTGGG CCAAAATATT GAAATGATGG 1080 

30 

AACTGACTGA TAAAGCTGTA CAAATAAGCA GTGTGCCTAA CAAGCAACAC AGTAATGTTG 1140 

ACATGCTTAA TTCACAAATG CTAATTTCAT TATAAATTGT TTTGCTAAAA TACACTTTGA 1200 

35 AACTArmT CTGTATTCCA AGAGCTGAGA TCTTAGATTT TATGTAGTAT TAAGTGAAAA 1260 

AATACGAAAA TAATAAACAT TGAAG 1285 



40 



(2) INFORMATION FOR SEQ ID NO: 81: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

TCTCCAGCCC CAATTTCTAC GCGCACCGGA AGACGGAGGT CCTCTTTCCT TGCCTAACGC 60 

AGCCATGGCT CGTGGTCCCA AGAAGCATCT GAAGCGGGTG GCAGCTCCAA AGCATTGGAT 120 

55 

GCTGGATAAA TTGACCGGTG TGTTTGCTCC TCGTCCATCC ACCGGTCCCC ACAAGTTGAG 180 

AGAGTGTCTC CCCCTCATCA TTTTCCTGAG GAACAGACTT AAGTATGCCC TGACAGGAGA 240 

60 TGAAGTAAAG AAGATTTGCA TGCAGCGGTT CATTAAAATC GATGGCAAGG TCCGAACTGA 300 
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TATAACCTAC CCTGCTGGAT TCATGGATGT CATCAGCATT GACAAGACGG GAGAGAATTT 360 

CCGTCTGATC TATGACACCA AGGGTCGCTT TGCTGTACAT CGTATTACAC CTGAGGAGGC 420 

CAAGTACAAG TTGTGCAAAG TGAGAAAGAT CTTTGTGGGC ACAAAAGGAA TCCCTCATCT 480 

GGTGACTCAT GATGCCCGCA CCATCCGCTA CCCCGATCCC CTCATCAAGG TGAATGATAC 540 

10 CATTCAGATT GATTTAGAGA CTGGCAAGAT TACTGATTTC ATCAAGTTCC ATTCACCCAG 600 

CCAGGTGGTC TCGTCACCTC AGAGGCTCCG CAGACTCCTG CCCAGGCCAG GACTGAGGCA 660 

AGCCTCAAGG CACTTCTAGG ACCTGCCTCT TCTCACCAAG ATGAACTCAC TGCTTTCTTG 720 

15 

GCAGCTACTG CTTTTCCTCT GTGCCACCCA CTTTGGGGAG CCATTAGAAA AGGTGGCCTC 780 

TGTGGGGAAT TCTAGACCCA CAGGCCAGCA GCTAGAATCC CTGGGCCTCC TGGCXTCCSGG 840 

20 GGAGCAGAGC CTGCCGTGCA CCGAGAGGAA GCCAGCTGCT ACTGCCAGGC TGAGCCGTCG 900 

GGGGACCTCG CTGTCCCCGC CCCCCGAGAG CTCCGGGAGC CCCCAGCAGC CGGGCCTGTC 960 

CGCCCCCCAC AGCCGCCAGA TCCCCGCACC CXAGGGCGCG GTGCTGGTGC AGCGGGAGAA 1020 

25 

GGACCTGCCG AACTACAACT GGAACTCCTT CXX3CCTGCGC TTCGGCAAGC GGGAGGCGGC 1080 

ACCAGGGAAC CACGGCAGAA GCGCTGGGCG GGGCTGAGGG CGCAGGTGCG GGGCAGTGAA 1140 

30 CTTCAGACCC CAAAGGAGTC AGAGCATGCG GGGCGGGGGC GGGGGGCGGG GACGTAGGGC 1200 

TAAGGGAGGG GGCGCTGGAG CTTCCAACCC GAGGCAATAA AAGAAATGTT GCGTAACTCA 1260 

AAAAAAAAAA AAAAAAAANC TCGGGGGGGG 1290 

35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 82: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

TTTATTGTAT TCTGTAACTA TAGAACTTCT ATTTWATTCT TTTTTGGACT TGCTAAGTTG 60 

TCTTTWATGG TTTTWAGTTC CATGCTGAAG TTTTCAGTAT TGACTTATCC CCTTGAACAT 120 

GAGTTGTTTT ATAGACTCTR ATGATTCAAA AATCTTACAT CTTTTGGTAG TCTCTTTCAT 180 

55 TTGTYCACTG TTTCTGTTGA TTCTWACTCA TGGTATTTTA ATTCTTCGTT WrTTTTTTTC 240 

TGTTWAGAWA CATTCTTTGA AAAATAATTT GGAGGAATAT TTGATTCTTA TGAACAAGGC 300 

ATTACTCACC AGAGAAGATT TTTTTGTTyT ACCARGTGCC TARGAATGCT AACAGTCTGG 360 

60 
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GAMCACATAG AMCACCAGGT GATGAGACAA TCCTGGGART CCTGTTTTAC TTTGGSCCAT 420 

CTTTTCTCCC AACCCTGTGG GAATARTCAT YCATATCCTA RCTGCAGGCT ARAAGGTGGT 480 

_5 TTATCAGAGC CCAACTTCGA GGGCTCTGGG CTTTAGCTAC TGTCACCCCA TCATAACTGA 540 

GCTTCATGGA TTGATTCTCT TTTTATCTTT CAGATTITCT TTTAAAAATC a ' i ' i ' b - rAUU '' I ' l ' 600 

TTTTTCrrCC GAAAGATTCC CCCAACATTA CCATTCCCXIA CCTTCCGTTG AATTTTTTTG 660 

10 

GCTCTCATTT TGAATTTTTC AAGA 684 



15 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2024 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

25 

CTGCAGGAAT TCGGCACAGC TGCGCTGGAG GCTTCATCTT TGCCGCCGCT GCCGTCGCCT 60 

TCCTGGGATT GGAGTCTCGA GCTTTCTTCG TTCGTTCGYC GGCGGGTTCG CGCCCTTCTC 120 

30 GCGCCTCGGG GCTGCGAGGC TGGGGAAGGG GTTGGAGGGG GCTGTTGATC GCCGCGTTTA 180 

AGTTGCGCTC GGGGCGGCCA TGTCGGCCGG CGAGGTCGAG CGCCTAGTGT CGGAGCTGAG 240 

CGGCGGGACC GGAGGGGATG AGGAGGAAGA GTGGCTCTAT GGCGATGAAA ATGAAGTTGA 300 

35 

AAGGCCAGAA GAAGAAAATG CCAGTGCTAA TCCTCCATCT GGAATTGAAG ATGAAACTGC 36jD^ 

TGAAAATGGT GTACCAAAAC CGAAAGTGAC TGAGACCGAA GATGATAGTG ATAGTGACAG 420 

40 CGATGATGAT GAAGATGATG TTCATGTCAC TATAGGAGAC ATTAAAACGG GAGCACCACA 480 

GTATGGGAGT TATGGTACAG CACCTGTAAA TCTTAACATC AAGACAGGGG GAAGAGTTTA 540 

TGGAACTACA GGGACAAAAG TCAAAGGAGT AGACCTTGAT GCACCTGGAA GCATTAATGG 600 

45 

AGTTCCACTC TTAGAGGTAG ATTTGGATTC TTTTGAAGAT AAACCATGGC GTAAACCTGG 660 

TGCTGATCTT TCTGATTATT TTAATTATGG GTTTAATGAA GATACCTGGA AAGCTTACTG 720 

50 TGAAAAACAA AAGAGGATAC GAATGGGACT TGAAGTTATA CCAGTAACCT CTACTACAAA 780 

TAAAATTACG GTACAGCAGG GAAGAACTGG AAACTCAGAG AAAGAAACTG CCCTTCCATC 840 

TACAAAAGCT GAGTTTACTT CTCCTCCTTC TTTGTTCAAG ACTGGGCTTC CACCGAGCAG 900 

55 

GAGATTACCT GGGGCAATTG ATGTTATCGG TCAGACTATA ACTATCAGCC GAGTAGAAGG 960 

CAGGCGACGG GCAAATGAGA ACAGCAACAT ACAGGTCCTT TCTGAAAGAT CTGCTACTGA 1020 

60 AGTAGACAAC AATTTTAGCA AACCACCTCC GTTTTTCCCT CCAGGAGCTC CTCCCACTCA 1080 
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CerrCCACCT CCTCCATTTC TTCCACCTCC TCCGACTGTC AGCACTGCTC CACCTCTGAT 1140 

TCCACCACCG GGTTTTCCTC CTCCACCAGG CGCTCCACCT CCATCTCTTA TACCAACAAT 1200 

-5 

AGAAAGTGGA CATTCCTCTG GTTATGATAG TCGTTCTGCA CGTGCATTTC CATATGGCAA 1260 

TGTTGCCTTT CCCCATCTTC CTGGTTCTGC TCCTTCGTGG CCTAGTCTTG TGGACACCAG 1320 

10 CAAGCAGTGG GACTATTATG CCAGAAGAGA GAAAGACCGA GATAGAGAGA GAGACAGAGA 1380 

CAGAGAGCGA GACCGTGATC GGGACAGAGA AAGAGAACGC ACCAGAGAGA GAGAGAGGGA 1440 

GCX3TGATCAC AGTCCTACAC CAAGTGTTTT CAACAGCGAT GAAGAACX3AT ACAGATACAG 1500 

15 

GGAATATGCA GAAAGAGGTT ATGAGCGTCA CAGAGCAAGT CGAGAAAAAG AAGAACGACA 1560 

TAGAGAAAGA CGACACAGGG AGAAAGAGGA AACCAGACAT AAGTCTTCTC GAAGTAATAG 1620 

20 TAGACGTCGC CATGAAAGTG AAGAAGGAGA TAGTCACAGG AGACACAAAC ACAAAAAATC 1680 

TAAAAGAAGC AAAGAAGGAA AAGAAGCGGG CAGTGAGCCT GCCCCTGAAC AGGAGAGCAC 1740 

CGAAGCTACA CCTGCAGAAT AGGCATGGTT TTGGCCTTTT GTGTATATTA GTACCAGAAG 1800 

25 

TAGATACTAT AAATCTTGTT ATTTTTCTGG ATAATGTTTA AGAAATTTAC CTTAAATCTT 1860 

GTTCTGTTTG TTAGTATGAA AAGTTAACTT TTTTTCCAAA ATAAAAGAGT GAATTTTTCA 1920 

30 TGTTAAGTTA AAAATCTTTG TCTTGTACTA TTTCAAAAAT AAAAAGACAG CAATGACTTT 1980 

ATATCCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAGGGC GGCC 2024 



35 

(2) INFORMATION FOR SEQ IIT NO: 84: ' ------- - - - 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 931 k^ase pairs 

CB) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

45 (xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 84: 

CGCGCCMATA GCCGGACGGG GATCTGAGCT GGCAGGATGA ATGTGGGGGT GGCACACAGC 60 

GAAGTAAACC CCAACACCCG AGTGATGAAT AGCCGAGGCA TCTGGCTGGC CTACATCATC 120 

50 

rrTGGTAGGAT TGCTGCATAT GGTTCTACTC AGCATCCCCT TCTTCAGCAT TCCTGTTGTC 180 

TGGACCCTGA CCAACGTCAT CCATAACCTG GCTACGTATG TCTTCCTTCA TACGGTGAAA 240 

55 GGGACACCCT TTGAGACTCC TGACCAAGGA AAGGCTCGGC TACTGACACA CTGGGAGCAA 300 

ATGGACTATG GGCTCCAGTT TACCTCTTCC CGCAAGTTCC TCAGCATCTC TCCTATTGTG 360 

CTCTATCTCC TGGCCAGCTT CTATACCAAG TATGATGCTG CGCACTTCCT CATCAACACA 420 

60 
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GCCTCATTGC TAAGTGTACT GCTGCCGAAG TTGCCCCAGT TCCATGGGGT TCGTCTCTTT 480 

GGCATCAACA AATACTGAGG GATGGGTTTT GGGACAGCTC CATGGGCATG GGGAAGGCAC 540 

5 TGAAACAGAG GACTATAAAA CATCCTTCTC TTATTCTCCA TACTCTCTTC TACACCTTTA 600 

AAGCCTGAGA ACTATAC7UVC CTTTCCCAGA CTCCCAAGAA GAGAAGAGAT TGGCAAATGG 660 

GGCTCCTGGG CCCAGTCCTG CTAGTGGCAA GTTrCTTTGA ATCAGGAAGG CAGGTGAGGT 720 

AAGGGCCAAA TCACTCTCCT CCATAGCAGG AAGCCATTTG GGCAGCTCCT 1TGGTGATTA 780 

CATCTTTCCA TATCTTTTAC ACTTACCACC TTCCAGCTCT GmTGCTGT GTATTTTTCT 840 

15 TACAATAATT TTTTTCAGCT ATAGCTGCAG TTTAATCAGG ATGGGTAGAG AGCTGTCCTC 900 

ATAAGGCTGG GGGTGGGAAG ATCGAATACT G 931 



10 



20 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 85: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 825 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

CGGGGCCGGC GGGGTCTTCA GGGTACCGGG CTGGTTACAG CAGCTCTACC CCTCACGACG 60 

CAAACATGGC AGCGCAGAAG GACCAGCAGA AAGATGCCGA GGCGGAAGGG CTGAGCGGCA 120 

35 

CGACCCTGCT GCCGAAGCTG ATTCCCTCCG GTGCAGGCCG GGAGK3GCTG GAGCGGCGCC 180 _ 

GCGCGACCAT CCGGCCCTGG AGCACCTTCG TGGACCAGCA GCGCTTCTCA CGGCCCCGCA 240 

40 ACCTGGGAGA GCTGTGCCAG CGCCTCGTAC GCAACGTGGA GTACTACCAG AGCAACTATG 300 

TGTTCGTGTT CCTGGGCCTC ATCCTGTACT GTGTGGTGAC GTCCCCTATG TTGCTGGTGG 360 

CTCTGGCTGT CTTTTTCGGC GCCTGTTACA TTCTCTATCT GCGCACCTTG GAGTCCAAGC 420 

45 

TTGTGCTCTT TGGCCGAGAG GTGAGCCGAG CGCATCAGTA TGCTCTGGCT QGAGGCATCT 480 

CCTTCCCCTT CTTCTGGCTG GCTQGTGCGG GCTCGGCCGT CTTCTGQGTG CTGGGAGCCA 540 

50 CCCTGGTGGT CATCGGCTCC CACGCTGCCT TCCACCAGAT TGAGGCTGTG GACGGGGAGG 600 

AGCTGCAGAT GGAACCOGTG TGAGGTGTCT TCTGGGACCT GCCGGCCTCC CGGGCCAGCT 660 

GCCCCACCCC TGCCCATGCC TGTCCTGCAC GGCTCTGCTG CTCGGGCCCA CAGCGCCGTC 720 

CCATCACAAG CCCGGGGAGG GATCCCGCCT TTGAAAATAA AGCTGTTATG GGTGTCATTC 780 

AGGAAAAAAA AAAAAAAAGG GGGGCCCCTC TAGGGGTCAA AC?rTA 825 
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(2) INFORMATION FOR SEQ ID NO: 86: 

-5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOIOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

CATGTAAAAG GATGAAATGT GACTTCTGGT GTTTTTTTAT TTCTATGGAG GGACTTTCTG 60 

15 GGGACGGTTT CTGGCTCTCA GGCTCTGAGA AGCTGCAGTT TATGAGTGGC TCTGTGTGTG 120 

CTGCCACCTA CTGGAGAAGC CATAAGCTGC AGCTTTAGGA AAAQGGAACC CGGGGCAGAG 180 

TGTGGGGAAG TGGGATGGCA GCATGGCAGG GCTTTGGAAA ATGAGAGGTG AGAGTKTKTC 240 

20 

CAGGAAGGGT GTAAGGAGAG GATGGATCCT GATACATGGA TTCAGGATCA TTAGGGTCCT 300 

GTCTGGGACA CTGGCCTTCC TGCTTACCTG CTCTTTCCTT CCTCCTTGGT CGGAGGAGGG 360 

25 GCTGGCTCAC TGCTCTGGCT TCATTTTCCA GAGCTGOCTG CTGCAGTCAC ACTTAGGTCA 420 

TCTTCTCTCA CTTTTCTCCT TTTGCCGATT AGTGGACGTG ACAGAGATGT GAATGGGGCA 480 

GGGATGTCCT TTGATGGCAT CAAGACTTTA GCTTCTGGTG CGCTGTGTCC CAGCTCTGAT 540 

30 

TTCAGTTGCA GCCGTGATGG AMAGTTNGCA TGGAAGCTGA GACTCTCACT GACAGTGAAA 600 

CCCTCAAATG AACACAATCC CTGCTTTCCT GCCAAGGATC CTTGTAGGGT NCCCCCAGCT 660 

35 TCCCCACTTT TTTTCTGTGT CCTGACAAAG AAACACAGAG TAACTTGATT GCCCTGTGAC 720 

-CTSGeCASTT GGATTTCCCC- TGCAGGCTTG AGCCCAAGCC- AGAGCCTTGA' AAAGGTATTC" " "780" 

AGGTTGTTGC CCAAAACACT GAAAAAAACT GCCCTGGCCC TGAACCAAAT ACCTTGAACC 840 

40 

CTCGTAAACT CCATACCCTG ACCCCCTTGT TTTGGATATA CCCAGGTAGA ACAACTCTCT 900 

CTGACTGTCT GTTGTGAGGA TACGCTGTAG CCCACTCATT AAGTACATTC TCCTAATAAA 960 

45 TGCTTTGGAC TGATCACCCT GCCAGTCTTT TGTCTTGGGC AATCTATACT TTTNCTCAGA 1020 

GGTTCCCAAG GCCTACTGAA GGGACTTAAC ATACTCTTAA TGGCTTTCCT CTCTCTTGTT 1080 

TTACCTTATG CCCTCACTTC CTGAGTTAAC CTCCCAAATA CAGGATTCAC CTGTACCCAA 1140 

GCCCTTAGCT TCAAGAATAC AGGATCACCT GTACCCAAGC CCTTAGCTCA AGCTCTGCTT 1200 

TGGAAGAACC CAAACTAAGA CAGTGCTCCT GGTGCCCT 1238 



50 
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(2) INFORMATION FOR SEQ ID NO: 87: 
60 (i) SEQUENCE CHARACTERISTICS: 



SDOCID: <WO 983944aA2 1A> 
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(A) LENGTH: 1460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

-5 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 87: 



ATTGCCTTCT GGTCCCTGGT GACACTGGGG TCATCCTTCA TCCCCGGAGA GCATTTCTGG 60 



10 CTGCTCCTCC TGACCCGGGG CCTQGTGGGG GTCGGGGAGG CCAGTTATTC CACCATCGCG 120 



CCCACTCTCA TTGCCGACCT CTTTGTGGCC GACCAGCGCG ACCGGATGCT CAGCATCTTC 180 



TACTTTGCCA TTCCGGTGGG CAGTGGTCTG GGCTACATTG CAGGCTCCAA AGTGAAGGAT 240 

15 

ATGGCTGGAG ACTGGCACTG GGCTCTGAGG GTGACACCGG GTCTAGGAGT GGTGGCCGTT 300 



CTGCTGCTGT TCCTGGTAGT GCGGGAGCCG CCAAGGGGAG CCGTGGAGCG CCACTCAGAT 360 



20 TTGCCACCCC TGAACCCCAC CTCGTQGTGG GCAGATCTGA GGGCTCTGGC AAGAAATCCT 420 



AGTTTCGTCC TGTCTTCCCT GGGCTTCACT GCTGTGGCCT TTGTCACGGG CTCCCTGGCT 480 



CTGTGGGCTC CGGCATTCCT GCTGCGTTCC CGCGTGGTCC TTGGGGAGAC CCCACCCTGC 540 

25 

CTTCCCGGAG ACTCCTGCTC TTCCTCTGAC AGTCTCATCT TTGGACTCAT CACCTGCCTG 600 



ACCGGAGTCC TGGGTGTGGG CCTGGGTGTG GAGATCAGCC GCCGGCTCCG CCACTCCAAC 660 



30 CCCCGGGCTG ATCCCCTGGT CTGTGCCACT GGCCTCCTGG GCTCTGCACC CTTCCTCTTC 720 



CTGTCCCTTG CCTGCGCCCG TGGTAGCATC GTOGCCACTT ATATTTTCAT CTTCATTGGA 780 



GAGACCCTCC TGTCCATGAA CTGGGCCATC GTGGCCGACA TTCTGCTGTA CGTGGTGATC 840 

35 

CCTACCCGAC GCTCCACCGC CGAGGCCTTC CAGATCGTGC TGTCCCACCT GCTGGGTGAT 900 



GCTGGGAGCC CCTACCTCAT TGGCCTGATC TCTGACOGCC TGCGCCGGAA CTGGCCCCCC 960 



40 TCCTTCTTGT CCGAGTTCCG GGCTCTGCAG TTCTCGCTCA TGCTCTGCGC GTTTGTTQGG 1020 



GCACTGGGCG GCGCACTTCC TGGGCACCGC CATCTTCATT GAGGCCGACC GCCGGCGGGC 1080 



ACAGCTGCAC GTGCAGGGCC TGCTGCACGA AGCAGGGTCC ACAGACGACC GGATTGTGGT 1140 

45 

GCCCCAGCGG GGCCGCTCCA CCCGCGTGCC CGTGGCCAGT GTGCTCATCT GGAGAGGCTG 1200 



CCGCTCACCT ACCTGCACAT CTGCCACAGC TGGCCCTGGG CCCACCCCAC GAAGGGCCTG 1260 
50 GGCCTAAACC CCTTGGCCTG GCCCAGCTTC CAGAGGGACC CTGGGCCGTG TGCCAGCTCC 1320 



CAGACACTAC ATGGGTAGCT CAGGGGAGGA GGTGGGGGTC CAGGAGGGOG ATCCCTCTCC 1380 



AACAGGGGCA GCCCCAAGGG CTCGGTGCTA TTTGTAACGG GATTAAAATT TGTAGCCAGA 1440 

55 

AAAAAAAAAA AAAAAAAAAA 1460 



60 
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(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

CAGGTGCAAA GTGGGAAGTG TGAGTCCTCA GTCTTGGGCT ATTCGGCCAC GTGCCTGCCG 60 

GACATGGGAC GCTGGAGGGT CAGCAGCGTG GAGTCCTGGC CTTTTGCGTC CACGGGTGGG 120 

AAATTGGCCA TTGCCAOGGC GGGAACTGGG ACTCAGGCTG CCCCCCGGCC GTTTCTCATC 180 

CGTCCACCGG AYTCGTGGGC GCTCGCACTG GCGCTGATGT AGTTTCCTGA CCTCTGACCC 240 

GTATTGTCTC CAGATTAAAG GTACGACATT TGGAGGCCCC AGCGAGAAAC GTCACCGGGA 300 

GAAACGTCAC CGGGCGAGAG CGGKCCCGCT GTGTGCTCCC CCGGAAGGAC AGCCAGCTTG 360 

TAGGGGGGAG TGCCACCTGA AAAAAAAATT TCCAGGTCCC CAAAGGGTGA CCGTCTTCCG 420 

GAGACAGCGG ATCGACTACC ATGTGGGTGC CCACAAAAAT TYCACCTYTG AGTCCTCAAC 480 

TGCTGACCCC GGGGTCAGTT CCAGAGAGAA GGACTCCCTC CTGCTTGGAA GAGACCTCAC 540 

ACCGTCATCA CGATGCCAAC GGCTCTGAAG GTGGATGGCA TTCCTGCGTG GATTCATCAC 600 

TCCCGCATCA AAAAGGCCAA CRGAGCCCAA CTAGAAACAT GGGTCCCCAG GGCTGGGTCA 660 

GGCCCCTTAA AACTGCACCT AAGTTGGGTG AAGCCATTAG ATTAATTCTT TTTCTTAATT 720 

TTGTAAAACA ATGCATAGCT TCTGTCAACT TATGTATCTT AAGACTCAAT ATAACCCCCT 780 

TGTTATAACT GAGQGAATGA-ATGATTTGAT-TGGGGAAAAA <SVCAAGTGGG -GA^ -840" 

TCCAACCTGG TTTTTACTAA CCCTGTTTTT AGACTYTCCC TTTCCTTTAA TCACTCAGCC 900 

TTGTTTCCAC CTGAATTGAC TCTCCCTTAG CTAAGAGCGC CAGATGGACT CCATCTTGGC 960 

TCTTTCNACT GGCAGCCGCT TCCTYCAAGG ACTTAACTTG TGCAAGCTGA CTCCCAGCAC 1020 

ATCCAAGAAT GCAATTAACT GATAAGATAC TGTGGCAAGC TATATCCGCA GTTCCCAGGA 1080 

ATTCGTCCAA TTGATTACAC CCMAAAGCCC CGCGTCTATC ' ACCTTGTAAT AATCTTAAAG 1140 

CCCCTGCACC TGGAACTATT AACGTTCCTG TAACCATTTA TCCTTTTAAC TTTTTTGCCT 1200 

ACTTTATTTC TGTAAAATTG TTTTAACTAG ACCCCCCCTC TCCTTTCTAA ACCAAAGTAT 1260 

AAAAGCAAAT CTAGCCCCTT CTTCAGGCCG AGAGAATTTC GAGCGTTAGC CGTCTCTTGG 1320 

CCACCAGCTA AATAAACGGA TTCTTCATGT GTAAAAAAAA AAAAAAAAAA CTCGGAGGGG 1380 

GGGCCCGGTA CCCAA 1395 
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(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEXTOTH: 1186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GGCACGAGCC GGCAAGCCGA GCTAGGGTGA AAACTGGGGG CGCACCAGGA TGTNNGACAG 60 

AAAAGCAGAA GATGAGACTC TGTTCATTCA CTTTTCCTAG GCCCATCCTG TGGTCATCTT 120 

TCCCCCTCCC ATCATACCTC CTCCTTCCTG GAGCCTCTGC CGGCTTGGCT GTAATGGTGG 180 

CACTTAOCTG GATATTTCAG TGGGAGGATG AAAGGCGAGA CTCACCCTAC GCGGTGGGAC 240 

AGATGGGGAG AGGAAAAAGG CAGAGATGGC CAGGAGAGGG GTGCAGGACA AACCAGAGAG 300 

GTTGGGTCAG GGGAAAAGGG TGGGGAGAAA GAGGGGTGCA GGCCCTGCAG GCCGGTTAGC 360 

CAGCAGCTGC GGCCTCCCCG GGCCCTTGGC ATCCAACTTC GCAGACAGGG TACCAGCCTC 420 

CTGGTGTGTA TCATAGGATT TGTTCACATA GTGTTATGCA TGATCTTCGT AAGGTTAAGA 480 

AGCCGTGGTG GTGCACCATG ACATCCAACC CGTATATATA AAGATAAATA TATATATATA 540 

TGTATGTAAA TTATGGCACG AGAAATTATA GCACTGAGGG CCCTGCTGCC CTGCTGGACC 600 

AAGCAAAACT AAGCCTTTTG GTTTGGGTAT TATGTTTCGT TTTGTTATTT GTTTGTTTTT 660 

GTGGCTTGTC TTATGTCGTG ATAGCACAAG TGCCAGTCGG ATTGCTCTGT ATTACAGAAT 720 

AGTGTTTTTA- ATTCATCAAT-GTTGTAGTTA-ATGTCTAGGT-CAGGACGTGC^^Te^ ^BO- 

ATTTTAGGAG GTTGCCCAAT TTTGTTTCTT CAATTTTACT GGTTACTTTO TTGTACAAAT 840 

CAATCTCTTT CTCTCTTTCT CTCCTCCCCA CCTCTCACCC TTGCCCTCTC CATCTCCCTC 900 

TCCCGCCCTC CCCTCCTCCC TCTGGCTCCC CGTCTCATTT CTGTCCACTC CATTCTCTCT 960 

CCCTCTCTCC TGCCTCCTGC TGCCCCCTCC CCAGCCCACT TCCCCGAGTT GTGCTTGCCG 1020 

CTCCTTATCT GTTCTAGTTC CGAAGCAGTT TCACTCGAAG TrCTGCAGTC CTGGTTGCAG 1080 

CTTTCCGCAT CTGCCTTCGT TTCGTGTAGA TTGACGCGTT TCTTTGTAAT TTCAGTGTTr 1140 

CTGACAAGAT TTAAAAAAAA AAAAAGGAAA AAAAAAAAAA AAAAAA 1186 



55 (2) INFORMATION FOR SEQ ID NO: 90: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

- 5 AAAACATGCT TTCAGGGCGT CCCCTATGTA TTCGGGGGGC CCACGGACAC TCAGGCTGGA 60 

KATCCGTCCT CACTGCGCTC AAGATGGCCT CAGCAGACAC CAGTTACCCA GCTGAAAGTC 120 

ACAATCCCTC CCAGAAGTCT CCCAACACTA GTGCTGACCA GAGGTGGGGC TCTCAGGCTA 180 

10 

GGAGTTTCAC ACACAATGAC AGGCTGCTGG GGGACATTGC AGGACCCCTT TTCCTYTCCT 240 

CTCCATGCTA GAAGCCAGCC CTAGGMAGCT GCAGTTACTC CCTGTGACTC AGCAGCAGGC 300 

15 TGATTCAACA CAGCTGCCCA CACAAAGCCA GTGGTAATAC ATCTGTTTAC CTTTCCCTAT 360 

CACCCAGACA CAAGCCCCTT TCCCAGGTCA AACCACAGGC CGATGCATCT CCAGTTTGAC 420 

AGTCAAATCA CTACTTCCAT TGCTACTTTA GATCAGCCAA AGTGGTGACT GCTGCAGTGT 480 

20 

GTGGCTATCC CTACAAGGCC CACCCAAGGG ATGCCCAAAG CCCAACCTTC TCCAGGGCTG 540 

CAGCAGNAGC AACCCCACCA GCCTAAGTCC AGCAGAGGAC CTCCCACCCA ATGTCTTGTT 600 

25 CTAATTAGAA GGGGAAGTTA GCCACAGAAA ATCAACTTAT CTATAATTAC AAAATTCTCT 660 

TGACTCACCT TAAAGTTCCT ATTGACATCT ACTGCTTTTA AACCTATTTG AAAACTCTGA 720 

TACTAAAACA AATGACACTC TAAGAAAGTT TGGGAGCCCC ATGCTGAGAA CCATTTCTGT 780 

30 

GCAGTGAGGA TGTTTCCAGA AGCTACITAC CTACATGTGA ATGTGCCATT TTCTTTCCTT 840 

TTGTAGAGAA AATCCCCTTT ACTTTTTGGA ACAGTAATGG CAGCTTCTAG TACAGCCATT 900 

35 ACAGTrrCAT ATGAGAAAAA TTAAGAATAA CTATAAAATT GTTAAAATAT CCAATAATGG 960 

ATAATGATGG CCAGAAGATT " TAACATACAA AGTAATTCTC AATGTAAAGC TATTCAGCTC "1020 

TTCCAGGTTG AATGCCCTGT AACCCACCCT GACCTTCCAC ATCATCTTCA AAAAGCAGTT 1080 

40 

TCTCTGTTCC CCATGAITCT CCTATAAGGT AACTCTTTAG TCCTCCATTT AGCACATTTT 1140 

AAATCCTCCA AAGAATAAGT ATCATGTGAT TATTTTAGCT TTACAAAAAA AAAGTTGAAT 1200 

45 GGCGTTTTAT TTTCATGGCC TATAAGCAGG TACCTTAGTA GQGCAGATAT AGGAAAAACA 1260 

AATTAGAGCA AAACAAATCC TCTACAAATC CAAGGCAGGA AAAGTGGTGG CAGAGTGACT 1320 

CATTCTCCTG TCCCTCCCAT CAGGTCAAAT CAGGAGGCTG CAGTGAATGC CTGTTCnTG 1380 

50 

AATGTGTAGC AGTTGTTCCT GTAACTCTTT AAAACTTGGC TATAGGCTGT TTAQCACAGT 1440 

ACAGATTAAA GATACAGTTA CGTAAACAGC AAAGTAATTT TATAGTGCTT CATCCATTTA 1500 

55 TCATGCTTTG GTTTGCTAAT TTTTTCACAT ACCTTTTTCT ATCACAGTCT GTTGCTTTTG 1560 

TACACATTTC TCATATTGGG GTTCGACAGG TAAACACAAA CTGCtATTTC AGTAGAAAAA 1620 

GTTATTGTTA TGGAATATTA AACCCAATAA ATTGTATAT^ GGGTAAAAAA AAAAAAAAAA 1680 

60 
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AAAAAAAAAA AAAAAAAAAA AAAAAAATTC CTGCGGGCCG CA^K3CTTTTT CCCTTTGGGT 1740 
GAGGGGTTAT TTTNGGCTTG GGCACTGGGC CCTTCGTTTT TACAACGTCG TGANC3GGGGG 1800 
3 AACCCGGGGG GGGTTTCCCC C 1821 



10 (2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

20 TGCCCTTTTT CCCACCGATT CGGGGCNTGG TGAAGGTGGG AGATGTGAAC TCCAATTAAG 60 

GGACTGGAGA GAGGTGAAGA ATTTTGCAGG TGGGAGATTT GGATTTGAAT GTGGACTTGT 120 

AAATGACTTG ACCTTGCCAT CTGTGTTCAA GGTCACGGTT TGCTGTGGGG TTCCTGGGAG 180 

25 

AGCTTACTCA CCCCGGAGTC TTTTCTTTCT CTTGCTCCAA GAAGAGCCCT GTTGGTGCTr 240 

TACCACCGCT TGGAGTCTCC CGAGGACACA AACAGGCAGA GAGGGACGTG TAGGGAGAGT 300 

30 TCTTTCCTGT TTTCTGTGCT TTCCTTTTTA CAGGACTCCC GGAAGGCCAC TCATGGCCAT 360 

GCCAGGAGCT TTCTCAGAAA CAGTCATAAA CGATCTCTTG AGTCTCTTTC TTGTCCTCCC 420 

AGCTGAGCTT TCTTATTCCA CCCTTTCTGG TGTCTATAGG AATGCATGAG AAGACCCTGG 480 

35 

GACGTTTTTC TGCTCTCTTC TGGCCCTCCA TGGAGCCATG GGCCTCGGCC TCGGCGGCTC 540 

CTCACCCTCA CAATTTATTT CCTCCTCCCG TGCCAGCCCT TCTTTTGTGT CTGAAACCGG 600 

40 TTTTAAAATG TGACTCTCCC AGAGAAGAAG CCGCTGGCTG TATGAAACTT GACGGCGCTT 660 

TTGTAAGGTG CCACCCCCAA ACTTTAAGGT AGCTAAACCA ATTTTTAAAA GATTCAATGG 720 

CTTGTTCATC CTCCAGATGT AGCTATTGAT GTACACTTCG CAACGGAGTG TCTGAAATTG 780 

TGGTGGTCCT GATTTATAGG ATTTCATAAT TAAAATGTCT GCTGAATAAA AAAAAAAAAA 840 

AAAAACTCGA GGGGGGCCCG GT 862 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 92: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 696 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 



3DOCID:<WO 983944aA2 IA> 
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313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

CTGAGGCGAG TGAAGTQGAC TCTGAGGGCT ACCGCTACCG CCACTGCTGC GGCAGGGGCG 60 

-5 TGGAGGGCAG AGGGCCGCGG AGGCCGCAGT TGCAAACATG GCTCAGAGCA GAGACGGCGG 120 

AAACCCGTTC GCCGAGCCCA GCGAGCTTGA CAACCCCTTT CAGGACCCAG CTGTGATCCA 180 

GCACCGACCC AGCCGGCAGT ATGCCACGCT TGACGTCTAC AACCCTTTTO AGACCCGGGA 240 

10 

GCCACCACCA GCCTATGAGC CTCCAGCCCC TGCCCCATTG CCTCCACCCT CAGCTCCCTC 300 

CTTGCAGCCC TCGAGAAAGC TCAGCCCCAC AGAACCTAAG AACTATGGCT CATACAGCAC 360 

15 TCAGGCCTCA GCTGCAGCAG CCACAGCTGA GCTGCTGAAG AAACAGGAGG AGCTCAACCG 420 

GAAGGCAGAG GAGTTGGACC GAAGGAGCGA GAGCTGCAGC ATGCTGCCCT GGGRGGCACA 480 

GCTACTCGAC AGAACAATTG GCCCCCTCTA CCTTCTTTTT GTCCAGTTCA GCCCTGCTTT 540 

20 

TTCCAGGACA TCTCCATGGA GATCCCCCAA GAATTTCAGA AGACTGTATC CACCATGTAC 600 

TACCTCTGGA TGTGCAGCAC GSTGGNTCTT CTCCTGAAYT TCMTCGSCTG CCTGGCCAGT 660 

25 TCTGTGTQGA AT^CAACAAT GGCGAGGCTT TGGGTT 696 



30 (2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1886 base pairs 
(B> TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPGIiOGY: linear 

(xi) SEQUENCE DESCRIPTICftJ: SEQ ID NO: 93: 



40 CAQGCCACTG ACGCTTCTTT GCGAGGGATG CAGGAGGTCC TACAGAGAAA GGCGCTTCTT 60 

GCATKTCAGA GGGCCCACAG CCTGTCACCC ACAGATCACC AAGCAGCTTT CTACCTGGCT 120 

CTGCAGCTTG CCATCTCCAG ACAGATCCCA GAGGCTCTGG GGTATGTCCG CCAAGCTCTT 180 

45 

CAGCTTCAAG GTGACGATGC CAACTCCCTG CACCTCCTTG CCCTCCTGCT GTCAGCACAG 240 

AAGCATTACC ATGACGCTCT GAACATCATC GACATGGCCC TGAGTGAATA CCCAGAAAAT 300 

50 TTCATACTAC TGTTTTCCAA AGTGAAGTTG CAGTCACTCT GCCGAGGCCC GGACGARGCA 360 

CTGCTGACTT GTAAGCACAT GCTGCAGATA TGGAAATCCT GCTACAACCT CACCAACCCC 420 

AGTGATTCTG GACGTGGGAG CAGCCTCTTA GATAGAACCA TTGCTGACAG ACGACAGCTT 480 

55 

AATACAATTA CTTTGCCAGA CTTCAGCGAT CCCGAGACAG GCTCCGTCCA TGCCACATCG 540 

GTAGCAGCCT CAAGAGTGGA GCAGGCACTG TCGGAAGTGG CTTCGTCTCT GCAGAGCATG 600 

60 CCCCTAAGCA GGGCCCGCTG CACCCCTGGA TGACGCTGGC ACAGATCTGG CTCCATGCAG 660 
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CTGAAGTCTA TATCGGCATC GGGAAGCCTG CAGAAGCCAC AGCCTGTACC CAAGAAGCTG 720 

CCAACCTCTT CCCAATGTCC CACAATGTCC TCTACATGCG CGGCCAGATT GCTGAGCTCC 780 

.5 

GGGGAAGCAT GGACGAGGCG CGGCGGTGGT ATGAAGAGGC CTTAGCCANT CAGCCCCACC 840 

CACGTGAAGA GCATGCAGCG ACTTGGCCCT GATCCTTCAC CAGYTAGGCC GYTACAGTYT 900 

10 GGCGGAGAAG ATCCTCCGGG ACGCGGTGCA GGTGAACTCG ACAGCCCACG AGGTCTOGAA 960 

CGGGCTGGGC GAGGTCCTCC AAGCTCAGGG CAACXaVTOCG GCGGCTACGG AGTGCTTCCT 1020 

GACAGCCTTG GAGCTGGAGG CCAGCAGCCC CGCCGTGCCC TTCACCATCA TCCCCCX3CGT 1080 

15 

GCTCTGAGCA GGCGCCTGCC AGCCTCACCT GCCGCTCAGC CTNCAGAGGC CCTGCCGGGC 1140 

ACCAGGGCTT GTGCCATCGC CCCAAGGGGA TGAATCTGCC GCACTGAGGC CAGGGACGAG 1200 

20 TGTTCAGTGG GCXrACAGTGA ACXIAACCAAA CCAACCCCGA ATCATCGCTC TCGCCATGTG 1260 

CGTTTCTCTT GTTTTTTTTG CCAGCCCAAT GGTAGTTTCT GAACCTATTG ACATTGTTCA 1320 

AAATGGATCA TGTGCCATAT TTTGTTAGTT GACATCTGAG TTTTCAGTAA AATGATTATG 1380 

25 

GAATTAATCA GCAAATGTAG AAGAATATAT TCAAAGTTAA AATTCAGTGG CAGCACAGAT 1440 

TATTTTTATC AGAGCTGTAA AGAAAACAAC TGTCCTTTTC TCCCCACCAC CCCTCCTGCC 1500 

30 CCACTTTGGC CCAGAAACCA AATGTGAACT TCCTGTCTCC CACCTCAGCA CTTAGTCCATG 1560 

CCAGGACACC AGCTGACAAT TTCTTGGTTT TACTGTCAAT AATTGTACCA TGTGATCAAT 1620 

TACTGTCCTC ACTTAGAACA AAGCCTGAGT CCGAGAATAT TTATATTTTA CX:AATATATG 1680 

35 

CCTGTTACAA GAGAAGGAAA TATGAGTTAT TTAAGTTTAA CTTTTTTATG . TGAATTCAGA . 1740 . 

GTTTATTTAT CGAGGGAAAT ATGTACAAAG AAGCTTCAAA TGGAATATTT ACCGACATTC 1800 

40 CTTATACATG ACAGACACTT GGCTACATGG GAAGATGATG TTAATAATAA AATGATTTTT 1860 

AAATGGAAAA AAAAAAAAAA AAAAAN 1886 



45 



(2) INFORMATION FOR SEQ ID NO: 94: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

CTCAGCTACC GTATACAGTA GGACATAACC CCATTTCACA TGCACTACAC TGAGACTTGC 60 
CTCCTCTCCC CCCACATTGA AGATGTTCTT TTTTCATAAC TATATACTAT TCCATTGCAT 120 

60 
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GAATATTCTG TAATTTATTT AATCCCCTAT 
AGTGTAATTA ACA1TCCTGT ACATGTATTT 
5 AATAACTAGA AATTTATTGG ATCAGGITTC 
AAGATTTCAC CAATTTACAA CTCCATCATT 
CAACCCTGAA TCCTTAAAAA TTTTTGCCAA 
GAATATTAAT GAGGAGGAAC ATCTTTTCAT 
AATTGCTTTT GCCCATTTTC CTTTTTTTAA 
GTATAAAAAA CACAGTTCTT TGAATAGAGA 
TCCACTTACC ACAGTTTAAC ATACATCCTC 
CATAAAAACA CTTTTTACAT AAATAGGATC 
CTCAAAAAAA GATAACAGGT CTTTAAATTT 
GAAAATGCCA TTATTTATTC CCTTAATTTT 
ACATCCTATT CAGATGTCTT TGTGCATGTG 
AAGGTGGATT TTTGGATCT^ AGGGTTTGTT 
TTCATAACTG TATTTTCACC AAGTGTATGG 
TACACTTGAT AGTTTTTATC TGTTGGGCGA 
TGATTATAAA AAAAAATGGT GAGATTGGGG 
TTACTGTGGA TTGTrTGTAT CCCTTACCTG 
TTTTTATTTG' TTCAGCATCT' CCTTCCCCAT" 
GGGGAACCTA TTCGXrTGTGG CTTAGGTGAG 
ACAGCTTCCT AGCCACAGTG ATAAAAGAAT 
GCCCTTAACA GAACTTCTGC TGGAACTACT 
CCAAGGACCA TOTAAGCXTTG AATTTGTGCC 
CX3GATGCTAG CAGAAATGGA AAGAGAACTA 
GATCCAGCTG TGCCTAAAGC CTGCCCTACT 
GTCCCTTTCT TGTTTAAGAT T^TTGAATTG 
GTATTTTCTT ATTGATTTGT AGAAAACCTT 
ATATAAGTTA ATAAAATTAG CATGGCCTTC 



GGATTGATAA TTAGGTTCAT TATAGATAGA 
TGCTACTTGT GTGGGTATTT CTGTAGGATG 
ACATTTGCAG TTTTGAAAAC TACTACCAAA 
AGTAAGAATG CCTGTTTGCC TATAGTCTGC 
TCTGGTAGGC AAAATTTCTT TCTTTTCrTT 
GTTTCTTGGC CATTTGCATT TCCTATTATG 
TTATGAAAGT CTAATGACTA CCTTCTCATT 
GACCCTTTTC TCCAATGCTA CCAATCACAT 
TAGTCACXTTT TCCGTACGAA TATACATACA 
TCATATTCTG TAGCTTTTTA AAATTTTGGT 
CTTTAATGGT TGAATATGAT TAAATACTAT 
TTTCCTCTCG CTATTACATT GCCAAAGTAA 
TGTGAATATT TCTTTAGTCT GGAGTCCAGT 
CTCTGTCCAC CTTCAGTCTT CCCAAAGGCX: 
AGAATGTTCA TTTCCCCATA TAACCATACC 
AAAAGAACCT TTTCTTATTT TGCATTTCCC 
TTATTTTCAT GTTTATTGGC CATTTATAGT 
CTTTCTATTG GGTTATGTGT GGATATATTG 
CTTCTCGTAA^CACAAC^ ATTTATTOGT 
CATGTGACCA GGCCTGGCCT CCTGAGTCCC 
GGGTATATAA CTTAAGCCAG GCTAAGGAAA 
GGAAAGAAGG CTTTATGGAG ATCCCAGGAA 
ATGTGGAGAG AGTCTGTCTG AGGAGAAACT 
AGTTCTGATG TCATTTTTCT GGAGGCCCTA 
CCGGACTTTA TAGTTTTGTG AGCCAATAAA 
AGTTTCTGTT CTGATTAATA TAGGTTATTT 
TGTAATTTTA AATTCTAGAC TTTATGCACT 
CATG 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2503 base pairs 

(B) TYPE; nucleic acid 
-5 (C) STRANDEDNESS : doiible 

<D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

10 GGCACGAGCG AAGGCAAGGG GGCACCAGCT CAGGACTGCA TCTGCCTGCC ATTTCCCTTC 60 

CACTCCTCCT TTCTGGAGTC TGACATTAGA AAGCCAGCGA GAAGGAAGAT TCAAACAACC 120 

AACCCTGATT TCCTGCTTCT CCTTTTCATG AGTGTTCCTG TGGTCTCTGC ACCTCCTTTC 180 

15 

TGTCCCCCGG CAGAGGGCAG TAGAGATGGC CGGCCCAAGG CCTCRGTGGC GCGACCAGCT 240 

GCTGTTCATG AGCATCATAG TCCTCGTGAT TGTGGTCATC TGCCTGATGT TATACGCTCT 300 

20 TCTCTGGGAG GCTGGCAACC TCACTGACCT GCCCAACCTG AGAATCGGCT TCTATAACTT 360 

CTGCCTGTGG AATGAGGACA CCAGCACCCT ACAGTGTCAC CAGTTCCCTG AGCTGGAAGC 420 

CCTGGGGGTG CCTCGGGTTG GCCTGGGCCT GGCCAGGCTT GGCGTGTACG GGTCCCTGGT 480 

25 

CCTCACCCTC TTTGCCCCCC AGCCTCTCCT CCTAGCCCAG TGCAACAKTG ATGAGAGAGC 540 

GTCGCGSCTG GCAGTGGGCT TCCTGGCTGT KTCCTCTGTG CTGCTGGCAG GCGGCCTGGG 600 

30 CCTCTTCCTC TCCTATGTGT GGAATGGGTC ARGCTCTCCC TCCCGGQGCC TGGGTTTCTA 660 

GCTCTGGGCA GCGCCCAGSC CTTACTCATC CTCTTGCTTA TAGCCATGGC TGTGTTCCCT 720 

CTGAGGGCTG AGAGGGCTGA GAGCAAGCTT GAGAGCTGCT AAAGGCTTAC GTGATTGCAA 780 

35 

GGGTTCAGTT CCAACCATCG .TCAGAGGTGG_ CACATCTGCT-O^^^ - . - - 840- 

TAACGCTGAT CTCCAGCTCC AGCGATGGAA CCCACTACAG AGGAGGTGGG GCCCCTGTGT 900 

40 CAAAGAGGCC GAGGGGCAGC AAGGGCAGMC AGGGCACCTG TGACTTCTTA GTACAAGATT 960 

GTCTGTCCTT CAGGACTTCC AAGGCTCCCA AAGACTCCCT AAACCATGCA GCTCATTGTC 1020 

ACACCAATTC CTGCTTTAAT TAATGGATCT GAGCAAATCT TCCTCTAGCT TCAGGAGGGT 1080 

45 

GGGGAGGGAG TGATTGCTGT CATGGGGCCA GACTTCCAGG CTGATTTGCC AAATGCCAAA 1140 

ATGAAACCTA GCAAAGAACT TACGGCAACA AACGAGGACA TTAAAAGAGC GAGCACCTCA 1200 

50 GTGTCTCTGG GGACATGGTT AAGGAGCTTC CACTCAGCCC ACCATAGTGA GTGGGCCGCC 1260 

ATAAGCCATC ACTGGAACTC CAACCCCAGA GGTCCAGGAG TGATCTCTGA GTGACTCAAC 1320 

AAAGACAGGA CACATGGGGT ACAAAGACAA GGCTTGACTG CTTCAAAGCT TCCCTGGACC 1380 

55 

TGAAGCCAGA CAGGGCAGAG GCGTCCGCTG ACAAATCACT CCCATGATGA GACCCTGGAG 1440 

GACTCCAAAT CCTCGCTGTG AACAGGACTG GACGGTTGCG CACAAACAAA CGCTGCCACC 1500 

60 CTCCACTTCC CAACCCAGAA CTTGGAAAGA CATTAQCACA ACTTACGCAT TGGGGAATTG 1560 
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TGTGTATTTT CTAGCACTTG TGTATTGGAA AACCTGTATG GCAGTGATTT ATTCATATAT 1620 

TCCTGTCCAA AGCCACACTG AAAACAGAGG CAGAGACATG TACTCTGGTG TGATCTCTTG 1680 

-5 

TCCTCAGTGT CTCTTCTGGG CTCCTGTCCC TCTTGCTTTA TAGCTAGCTG CCXZGGGGACC 1740 

AAGGTACAGG TGAAAGCAAG GTAGCAGCTT GCGGGAGGAG GCCTGTCTQG CTTACCAGTC 1800 

10 TATACACTGT GGCCTCAACC TCCCAGACAG GGCAGAGAAC OXyrGGGCAGC TCGTTTGCTT 1860 

TCTAGGCTGG CTGGAGAGGT GGGAGCTCAT TGATAGACTC ATGATGGAAA CTATTTTTGA 1920 

AACAGGCTTC CTCCTTCAGG AGAGATCATG CGGACTAAAC TGTAGCAATT CXIAGTGCACC 1980 

15 

TGGCAGTGAT CCTTTTCTTT GCAAAGTACT GTCTCTTTGG TTCCAGTAAG TTGGACCACC 2040 

ACATGACATY ATTTTCCCTG GAACCTGGTC ACTGACTAAC ACAGACAATT QGGACTCCAG 2100 

20 AGCCTCAAGA GCCAGGAGAG GGCACAGTAC ATACAGAGGG AGTCAAATGG GATCTCATTT 2160 

TGAGTCCTGC CTTCCGCACA CTCAGAACGG CANCCCCAAG GCCCGGAGTG TCCAGQGCTT 2220 

CTGGCCTGAG GTGAATCTGC CAGGCCCAAG AAGGCACAAA GGTAGGAGCA CAGAGAGCCC 2280 

25 

CATTCCCACA GGCGGKCGGC CCAGCAGCAC CAGTGGAAGC TCAGCa?GTCC TCCAGCTGCT 2340 

CTCGGCAGAC AGTTCAGTGC ACAGTTTATG CCCTAGCTGA AAAAGATCTC CCX3GACX3TAT 2400 

30 TTCAGCACAT CCTCTTCCTC CTCCTCCTCA GGGCTCCTGC TACAGGCAGA GCTGGAACCC 2460 

CCCGGCCTCT GGGAAGGGCT GAGGCCTGGA GYCAGTGCCT GTC 2503 



35 



(2) INFORMATION FOR SEQ ID NO: 96: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2801 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiJble 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

CTGGAAAGCC GAGGGTAGCC GAGCGGGGCG GGCGCTCTGG AGCGGCGGGT GCTCGGGCTG 60 

CCGTCCGCTC CGCCAGAAGC ACCGAGCAGC CGAGCCGGGG CCCGCCGCCC TCCTCCTCCA 120 

50 

TGAGGCCCGA GTGAGGCGCG GCGGCTATAG CCGACCCGCG GCGCCTTCCC CCCGCGTCCT 180 

ATCGCGAGCG CACGACMAGC GGCCCCTGGA GGAGGAGGCG GAQGAGGAGG AGCATGTCGG 240 

55 ACGGTTTCGA TCGGGCCCCA GGTGCTGGTC GGGGCCGGAR CCGGGGCCTG GGCCGCGGAG 300 

GQGGCGGGCC TRAGGGCGGC GGTTTYCCGA AMGGARCGGR GCCTGCTGAG CGGRCGCGGC 360 

ACCAGCCGCC GCAACCCAAA GCCCCGGGCT TYCTGCARCC AMCGCCGCTG CGCCARCCCA 420 

60 
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GGACGACXXrC GCCGCCAGGG GCCCAGTGCG AGGTCCCCGC CAGCCCCCAG CGGCCTTCCr 480 

GGCCCGGGGC GCTCCCAGAG CAAACGAGGC CCCTGAGAGC TCCACCTAGT TCACAGGATA 540 

-5 AAATCCCACA GCAGAACTCG GAGTCAGCAA TGGCTAAGCC CCAGGTGGrr GTAGCTCCTG 600 

TATTAATGTC TAAGCTGTCT GTGAATGCCC CTGAATTTTA CXTCTTCAGGT TATTCTTCCA 660 

GTTACACAGA ATCCTATGAG GATGGTTGTG AGGATTATCC TACTCTATCA GAATATGTTC 720 

10 

AGGATTTTTT GAATCATCTT ACAGAGCAGC CTGGCAGTTT TGAAACTGAA ATTGAACAGT 780 

ITGCAGAGAC CCTGAATGGT TGTGTTACAA CAGATGATGC TTTGCAAGAA CTTGTGGAAC 840 

15 TCATCTATCA ACAGGCCACA TCTATCCCAA ATTTCTCTTA TATQGGAGCT CGCCTGTGTA 900 

ATTACCTGTC CCATCATCTG ACAATTAGCC CACAGAGTGG CAACTTCCGC CAATTGCTAC 960 

TTCAAAGATG TCGGACTGAA TATGAAGTTA AAGATCAAGC TGCAAAAGGG GATGAAGTTA 1020 

20 

CTCGAAAACG ATTTCATGCA TTTGTACTCT TTCTGGGAGA ACTTTATCTT AACCTGGAGA 1080 

TCAAGGGAAC AAATGGACAG GTTACAAGAG CAGATATTCT TCAGGTTGGT CTTCGAGAAT 1140 

25 TGCTGAATGC CCTGTTTTCT AATCCTATGG ATGACAATTT AATTTGTGCA GTAAAATTGT 1200 

TAAAGTTGAC AGGATCAGTT TTGGAAGATG CTTGGAAGGA AAAAGGAAAG ATGGATATGG 1260 

AAGAAATTAT TCAGAGAATT GAAAACGTTG TCCTAGATGC AAACTGCAGT AGAGATGTAA 1320 

30 

AACAGATGCT CTTGAAGCTT GTAGAACTCC GGTCAAGTAA CTGGGGCAGA GTCCATGCAA 1380 

CTTCAACATA TAGAGA7W3CA ACACCAGAAA ATGATCCTAA CTACTTTATG AATGAACCAA 1440 

35 CATTTTATAC ATCTGATE3GT GTTCCTTTCA CTGCAGCTGA TCCAGATTAC CAAGAGAAAT 1500 

ACCAAGAATT ACTTGAAAGA GAGGACTTTT TTCCAGATTA TGAAGAAAAT GGAACAGATT 1560 

TATCCGGGGC TGGTGATCCA TACTTGGATG ATATTGATGA TGAGATGGAC CCAGAGATAG 1620 

40 

AAGAAGCTTA TGAAAAGTTT TGTTTGGAAT CAGAGCGTAA GCGAAAACAG TAAAGTTAAA 1680 

TTTCAGCATA TCAGmTAT AAAGCAGTTT AGGTATGGTG ATTTAGCAGA ACACAAGAGA 1740 

45 GCAAGAAAAT GTGTCACATC TATACCAAAT TRAGGATGTT GAGTTATGTT ACTAATGTAT 1800 

GCAACTTTAA TTTTGTTTAA CACTATCTGC CAAAATAAAC TTTATTCCCT ATAACTTAAA 1860 

ATGTGTATAT ATATATAATA GTTTATTATG TACAGTTAAT TCTACTGTTT TGGCTGCAAT 1920 

50 

AAAATCGATT TTGAAATAAA TGAAATGTTG AAAATTTTGC TAGTTGGTTA GATGCTTATC 1980 

CTTTAAATTC TACTTTTCTT GAGGGGAAAA AGTCTTCGTC TGGAAATACA TATTACTCCA 2040 

55 AAAATGTAGC ATCCTTTTTT AGGTAGGAGT ATTATAGCTT YCATTTrAGT TKGACATTTA 2100 

GTGTCCXZAAT GAATTGAATT TCAAATATGA ATCATAATCT TGAAAATCTT TAGCACTAAA 2160 

GTCTTGGGAA TATATCAACA ACTGATTTAC ATATGCAGAT GCTATTTCaaA TACCAAGGGC 2220 

60 
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TTTTTAAATG TCATCGGGGG GAAAAACCCA 
GACTTCACTG GAAGATTTAT TCCAATTCTA 
-5 AACTGRCTAA CTTCATTACC TTAAAGCCTA 
TCTCACTTTT ATTTTGTAGC AKGGGTTGCA 
TATTTCTCAT TCAAGTTTTC ATCTGCTTTA 

10 

AATACTTTTA CTATAATGTG GTACCACCTC 
TCAAATCTTT TTCCAGCTAA CTAAAAACTG 
15 GTAAATAGTT CTGTTAATAA CCCACTGTTT 
AGTTAGCTTT CTCACTTTTC TGCTTGTTTG 
ATAAAGCTTA AAAAAAAAAA AAAAAAAAAA 

20 



319 

ACTTGGTGGA ACTCCCAGCT AAACAACCAA 2280 

GGAATTGTTC TTTTTTATTT TTATTTTTTC 2340 

GAACATTATT CTGCTTTATT TATATGGCTT 2400 

TCGACTTTTT TACTAGAGAA TTTTACTAGA 2460 

TAATTGATAC ACCTTGAGGG TCACTTTTCT 2520 

AGCCCTAATA AATAATATTT OTACCTAATG 2580 

TGTACAAAAG GATTGCTTGT AAATATGCAT 2640 

TACATTTGGT ACATCTGTGT CTGCTAATAC 2700 

TTCAGTCTGA ATTAAAATTA GACTTTGAAA 2760 

AAAAACTCGA G 2801 



25 



(2) INFORMATION FOR SEQ ID NO: 97: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1631 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
30 (D) TOPOLOGY: linear 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

ATGGAGCCAA AGACAATCAC TGATGCTTTG GCTTCTAGTA TAATTAAGAG TGTGCTGCCT 60 

AATTTTCTTC CATACAATGT CATGCTCTAC AGTGATGCTC CAGTGAGTGA ACTGTCCCTC 120 

GAGCTGCTTC TGCTTCAGGT TGTCTTGCCA GCATTACTCG AACAGGGACA CACGAGGCAG 180 

40 TGGCTGAAGG GGCTGGTGCG AGCGTGGACT GTGACCGCCG GATACTTGCT GGATCTTCAT 240 

TCTTATTTAT TGGGAGACCA GGAAGAAAAT GAAAACAGTG CAAATCAACA AGTTAACAAT 300 

AATCAGCATG CTCGAAATAA CAACGCTATT CCTGTGGTGG GAGAAGGCCT TCATGCAGCC 360 

45 

CACCAAGCCA TACTCCAGCA GGGAGGGCCT GTTGGYTTTC AGCYTTACCG CCGACCTTTA 420 

AATTTTCCAC TCAGGATATT TCTGTTGATT GTCTTCATGT GTATAACATT ACTGATTGCC 480 

50 AGCCTCATCT GCCTTACTTT ACCAGTATTT GCTGGCCGTT GGTTAATGTC GTTITGGACG 540 

GGGACTGCCA AAATCCATGA GCTCTACACA GCTGCTTGTG GTCTCTATGT TTGCTGGCTA 600 

ACCATAAGGG CTGTGACGGT GATQGTGGCA TGGATGCCTC AGGGACGCAG AGTGATCTTC 660 

55 

CAGAAGGTTA AAGAGTGGTC TCTCATGATC ATGAAGACTT TGATAGTTGC GGTGCTGTTG 720 

GCTGGAGTTG TCCCTCTCCT TCTGGGGCTC CTGTTTGAGC TGGTCATTGT GGCTCCCCTG 780 

60 AQGGTTCCCT TGGATCAGAC TCCTCTTTTT TATCCATGGC AGGACTGGGC ACTTGGAGTC 840 
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CTGCATGCCA AAATCATTGC AGCTATAACA TTGATGGGTC CTCAGTGGTG GTTGAAAACT 900 

GTAATTGAAC AGGTTTACGC AAATGGCATC CGGAACATTG ACCTTCACTA TATTGTTCGT 960 

-5 

AAACTGGCAG CTCCCGTGAT CTCTGTGCTG TTGCTTTCCC TGTGTGTACC TTATGTCATA 1020 

GCTTCTGGTG TTGTTCCTTT ACTAGGTGTT ACTGCGGAAA TGCAAAACTT AGTCCATCGG 1080 

10 CGGATTTATC CATTTTTACT GATGGTCGTG GTATTGATGG CAATTTTGrC CTTCCAAGTC 1140 

CGCCAGTTTA AGCGCCTTTA TGAACATATT AAAAATGACA AGTACCTTGT GGGTCAACX». 1200 

CTCGTGAACT ACGAACGGAA ATCTGGCAAA CAAGGCTCAT CTCCACXACC TCCACAGTCA 1260 

15 

TCCCAAGAAT AAAGTAGTTG TCTCAACAAC TTGACCTTCC CCTTTACATG TCCTTTTTTG 1320 

TGGACTTCTC TCTTTGGAGA TTTTTCCCAG TGATCTCTCA GCGTTGTTTT TAAGTTAAAT 1380 

20 GTATTTGACT TGTGTTCTCA GCATTCAGAG AGCAGCGGTG TAAGATTCTG CTGTTCTCCC 1440 

TGGATCTTCT GACATTACTG CTGTCTGAGA TTTGTATATG TGTAAATACA AGTTCCTTGA 1500 

TACCCTAAAA CCTTGGATTA AACAGAATGT GCATTGTACA TCTTTAAACA AAATGTATAT 1560 

TAATTTATTA AATCTAGTTG TCACTTTAAA AAAAAAAA7A AAAAAACTCG AGGGGGGCCC 1620 

GGTACCCAAA T 1631 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 98: 

35 (i) SEQUEl^CE CHARACTERISTICS: 

(_A)^ LE^IGTOj _ 50„4 _base ^airs _ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
CCGAGCTGGG CGAGAAGTAG GGGAGGGCAC GAGCCGCCGC GGTGGCGGTT GCTATCGCTT 60 
45 CGCAGAACCT ACTCAGGCAG CCAGCTGAGA AGAGTTGAGG GAAAC3TGCTG CTGCTGGGTC 120 
TGCAGACGCG ATGGATAACG TGCAGCCGAA AATAAAACAT CGCCCCTTCT GCTTCAGTGT 180 
GAAAGGCCAC GTGAAGATGC TGCGGCTGGA TATTATCAAC TCACTGGTAA CAACAGTATT 240 

50 

CATGCTCATC GTATCTGTGT TGGCACTGAT ACCAGAAACC ACAACATTGA CAGTTGGTGG 300 
AGGGGTGTTT GCACTTGTGA CAGCAGTATG CTGTCTTGCC GACGGGGCCC TTATTTACCG 360 
55 GAAGCTTCTG TTCAATCCCA GCGGTCCTTA CCAGAAAAAG CCTGTGCATG AAAAAAAAGA 420 
AGTTTTGTAA TTTTATATTA CTTTTTAGTT TGATACTAAG TATTAAACAT ATTTCTGTAT . 480 
TCTTCCAAAA AAAAAAAAAA AAAA 504 



60 
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(2) INFORMATION FOR SEQ ID NO: 99: 



<i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1416 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
10 (D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTIC»J : SEQ ID NO: 99: 

GGCACGAGGG AGGGAGCCCT CTCCGTTGGG TGACTCTTGT GTGCCCTTTA GACAGGCTGG 60 

CCTGCCGGTT CCACAGGGTA CAGTTAGGAC TTGAGTCTTT CTTTTTCTGT TTTGAGTTGG 120 

TGAGTGAGTG ATAGGGTAAC ATGGGCCTTC AGGATGACCC CTTGGAACTG TGCCGAGTTC 180 

20 CTTAAATCTC AGCTGGGATC CTGGACCTGG GAGGCCCCTG TGAGGGCCAG CTCTGGAAAA 240 

ACCTGGGAGT TGATGCCGGA GCTGTGGAAG AACTCTGCTC GAGGGCAGGG TGCCCTGGAA 300 

CACTGGTAGT TCTGGGGCTG GGAGGGAGAG GGGCTCCGGC TTTCTCTGAA ATGAACACTG 360 

25 

CTCTTCAGCA GTTCAAGTAC TTGTTCTCAA AACATTTTCT AATTGATTGG TAGGTTTTCA 420 

TAAGCATTGT TTCTTTAAGG CATGGAAAGG GAAGAATGCT CAAGCAAGTC ATGTTTGTTT 480 

30 TCAGTGGGAT GGGCCCGCGT TCTCACTGCT GGGGGCTTCC CCTTCATGTG GCACCTTTGT 540 

GCAGGGGCCA CCAGGCAGAC TCTTCCCACC TTCTCCCACT GAAGCACCAA GGGGCTTGGA 600 

ACCGTAATTT GGCTAATCAG AGGCATTTTT TTTGTCCTAG TATCTTTCAC ACTTGTCCAA 660 

35 

CCGTCTTATT TTTTTAAAAG TTCTGTTGCT TGTATTAACA CGAAACTAGA GAGAAATAGT 720 

TTCTGAAGCC AGTTTATTGT GAAGATCCCC AAGGGGACGT TCGGTAGAGA AAAATAGTAA 780 

40 GCTGGTTTAG AAACTGACGA GGGCAAACAG CCAGGACGCA TTGGAGAGGA ATTTGCCAAA 840 

GATCTACCCT GAGATAACGC CTGTCCAGTG TCTTCACCAC GTGAATAACC AGCGCTCCAA 900 

AGTGTTTTTC TGCTTTGAAA AAAAAAATTC CACAAGCTTT TAAAGGTGCA TTTAAGAATC 960 

45 

CATGTGACTT TAGAATGGAA CTGCCGGCCC TGGCAACTGT CACGTGTGCT AGAAGGTTCG 1020 

ATGCCTCTGG AATGCATGTG ATACTCATCT CCATTTTGTT TCCTTGATTG CATTTTTGTT 1080 

50 CTTTTAGCAG ATCTGTCCCT GTGGGTGGTG TCTAAGAAGT CGGACACCTT GGTTTTTGTG 1140 

TTAGATTGAG CTGGGCAGCT GCAATCAGCT TCTTTATATG CAAATTAGGC ACGACCCATC 1200 

TGTGGTTCCT GGTTGGTGGC TAATGAAGTG AGGQGAGGGA GGGATGTCAC CCCAAAAGTA 1260 

55 

GGCCCTCCCA TTGGCTTTGG CCAGGCCAGA CACTTCACAT CGTTTACATG GTTCTGTGTA 1320 

ATTTTAAAGT TTATGTGTAT AAAGCGAAGC TGTTTCTGTG AAACTGTATA TTTTGTAAAT 1380 

60 AAATATATTG CTACTTGAAA AAAAAAAAAA AAAAAA 1416 
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"5 (2) INFORMATION FOR SEQ ID NO: 100: 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 2847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

GGCTAGGACA ATTTTGGTGC TTTACCTATC TCTGCAAAGA CTGGAGAATT TGGCATACCA 60 

TTAATTACAA CCACCAATCA TATCCAACAA AAGTACCCTA AAAGAAGGAC CAGTGGCCAC 120 

TCTCGAAAAA ATTTAAGTAT CAGAAGATTA AAAAGATTTT AGGATTTGGA AGCTTGTATT 180 

GTCTTTCCCC AATAATCATT GTTTGATCTC CAAATAGTAG CCTTATATTA GCAATRGACA 240 

GATCATTGGT TCTCCATATC TGATCATATG TTACTACTTT GGAATCAGTA TTTGGGCAAA 300 

TTCAAGCATT TATGCAGTGG ATATAAATGG AAATATAAAA ATATTTGCCA ACCTGTCTCA 360 

GTAACTTATC ATATCTCTGT GNATCCTCAA GGAAAGCACT TTTGCTTTTA CTTAGAAAGC 420 

GTTTCAGATT TGCTTTATAG ACTCCTGCTG TCTTCAGTAC CTGATAAAAC TTTAACCAGG 480 

GAAGCATTAA ACACAGTGCA GCAGCTTTTG CCCAGGCTTC TAAGTTCCTG CCGGCAGCAT 540 

TTATCAATGT AAGAACTAGG ATGCTTCCTG CAGTGGCACT ACCTTCCCCT AGAGCTGGAG 60O 

CATGCTGCTT GGCCTTAAGC CCCAGCATGA TGAGGCTTCC CTCCTGCCAG GTCAGTAAAA 660 

GTTAGAGAGC TCAGAATTGG GTCTTGCCTG GGTGCAGGTG GCAGGGTTTG CTGAAACCCC 720 

TAAAGAGAAG TCACCAAGGG AGGCAGGTAA TGAATGTTTC CAGAATCAGT CKGATACTCA 780 

TAGCAATTTC TGGCTATCTT TCAAATGTTG AATTTCTGGA TGCTGAGAGG GACTTTGATT 840 

TGATATCATT AAATCCAGGA CAGTCCCAAG AAGTGCTTGG AGTCTCGGCT CTGACAGCCC 900 

AAGAAGGGAA ATAACTTGTA TTAAGGAACA ACTATGAGCC AGGCCCTGAG CTGTCTCTTA 960 

GATAATAAAA CAGATGGGGA GTGGAAGAGT CATTTGCTTC AAGTTATACA GCTAGGAAAT 1020 

ACTCAAGCCA AATCTTGAAC GCAGCTCCCC CTAATTCTGT GGACAGGCAC TTTGTACCAC 1080 

ACACCATGGT CCACCTAAAA ACAGAAGGAT AAAAAGACTT CAGGTTTTCC CACTGTGTGC 1140 

TGACCATCCC AATTTATGAA TCTTCTTCAA AATGACATTT . CACAGTTATA GTTAGGGCTC 1200 

AGAAATGGCA TTGAGGTAGC CTTATTTCTC CCCTTTAGCA GATGCTTTAA GTACACATTG 1260 

CTGACTTQAG CCCACCCCCA GGAGTTAGGA GAACATTTCC TTTTTCATGC CATCTTCCAT 1320 

AAATAAGGTG TTTCTTGGCC TTCAAAGATA TAGAACTTTG CAGCAGTAGT AAAAGTGAAG 1380 
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GGTGTTCTGC TCTCTACTCA ACTTTATTTG AAAATGTCTG CAGCTTCACT CCTGTAGAAA 1440 

AGGAAATCTT CATATTTTAG TAAACTTAGC CGCCAGTGTA CTCTGTGAGG ATGTGGCAAT 1500 

TCAAAGTCCA GTGAATCTGG CTCTCTTACT GATTCCTGGT TTTAGTGTGT GTGTCGGGGG 1560 

AGTGTGTACC TATATATAAA GGACAAGTGT GATATGTGTG TATATGTATA TACATACATA 1620 

CATGTCCACA CACACACACA CAATATTTGA GAGCTAAGGA AAACTCAAAG CAGCCXXTTTC 1680 

ATTATCTTGC GTACTACTTC AAAGATTTCT GTCAGCCCTA ATTACAAGTG TCACCATATA 1740 

GTTGGGGCTT AGGTACTTGC TTACAGGAAG AGCAATTCCC TAGCAAAGGT CATTAGCTCC 1800 

TAAGGCACTG AGTCAAAGTG ACAGCCCTGA AGGAAATTGC ACTCCAGCCC TCCTCCAGGA 1860 

TGTCTAATAA GATGGGAAAC TTGGATGCCC AGCCATTTTG GTGACCTGAG AGTCTAACTA 1920 

CTCCAGTTAG ACCTAAGGGC ACAAATGCAG AATTCATGAC CTTGTAGTTG TGGCAGGGTC 1980 

TAGGAAGTCC TCTCTCCCXIA AGTAGAAAAT ATTCTCTTGC CATTCCTGAA ATTCCACATT 2040 

CATATAATGG CTGTGCAATA CATGCTTCTC AATAAGAAAA TTAACTGCAT GTTTACTGTG 2100 

TGCTGATCAC ATCAGATTTT TATGTTTAAA AAAATCTCAT TAlGGOTTGA GTCCAGCCCA 2160 

GCTCTAAGAG AAAAAGAAGG CCCATATGGG AGACTTCAGT CTCATTATTA TTGCCTTTAT 2220 

CCAGCAGTGC TTATRAAGCC CCCTACCCTG TCCCATTCCA GAAACCATAA GACTCAGGCA 2280 

GTrCTTGATT CTGGAGGCCT GCCTGGTAAG ATAAGATAGT ATAATTTGGA ACTGAGAACA 2340 

TACCAGAAAC AGCAGAACGA GGGCCAGAGC AGAAAAATGA AAATAAGTGG AGACACTTAT 2400 

GGATACATTG GTGCAAATAA AGCCACGGGS CCCATACTGG GCTTGATATG ACTTTGAGGG 2460 
GACAGCAGAT TAATACtTAA TGAGGGTTAA ACCTGACCAG TOTTCTACA" Gn^GfiCAi3QCC ' ' 2520" 

ACACTGCATG AATGGGGAGA ACCAATGAAT CCATTGTCCT CTGCCTATTT TCCTGTGCAC 2580 

AGTCACATTC CCTCCTTAGG AATCTTCCCC TTCCACCCTT TACATTAAAC AAGGGAACAC 2640 

TGAATCTTTC AAGGGAATTA CACGTTTGGG TTAATGTTTC AGTATATCAT TTTCATACTG 2700 

TAAATTATTT TGTAAGAGAG ATTTACTGCT ATCCCAGGAT GTTCGGACTT QGTGCCCCTG 2760 

TGCATTTGGA AATCAATAAA CTATTACTGG AAATGCCAAA AAAAAAAAAA AAAAAAAAAN 2820 

NAAAAAACTC GAGQGGGGCC CGTACCC 2847 



(2) INFORMATION FOR SEQ ID NO: 101: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
60 (D) TOPOIiOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

GAGATTGGTG GAGGAGAGTA AATAATCTAG AGGCAAGAGT TCAGTGAGGG CCAAGGGGGA 60 

5 

CCCCCAGAAA AAGCTATGGA GCTAACTCAT CTCTTTTACA AGGGGTGGCC ATGACTTACT 120 

GTTGCAAAGT ACTCAGTGTA TATTTAATGT TGATTGTTGA ATTTTAGTTA CGAGAGGGAA 180 

10 GAACAATTTT ACTTCTGTCC TTATTTCACT TGCTGAAAAG CTGTGGGACA AAATGTATGG 240 

AATAGACAAG GCCACTTTCT TTGTGATTTC TGCTTTTCAT GCATATTATT TTATTTACCC 300 

ATAATTTCCA AGAGGTTTGG CGTTCCGCTC TCCTGCTTTT TTCTTTCATC CACCCCTTTC 360 

15 

CTTTTTTrGG AAGGGGGTTA TATATGAGAG TTCATTGAAG AAGTCCAGTG AGGCTGAAGT 420 

AAAGGGGCAA GATAGGGCAG TTAACTAAAG AGCACTTTAT TTCTTTGAAG CCTTTCTAAG 480 

20 AAAGAAATGG GGGTGCGAGT GGCTTGAATC TCCCATGATG TTGGAGGGCA CTTAGTGGGG 540 

TTGAAGTATG ACATAATATT TCCCATTGGG GAAAGGAGAA TTTCTCTTAG AGGGTGGCAA 600 

AATGCCTTTG CCCAGTGTCC CTATTTTAGG CATCTTTTCC rrCCTTATTC CTTCCAGTCA 660 

25 

GGGTGTGTCC TATACAAAAC TTCCCATCAG TTCTCCTCAA TATTCCCCAT TTGTAAATGA 720 

TCACTTCTCT TTTCTAAACC CTTTTCCTGT TCAGATCCAT ACAGGATTTG CAAGGGTAGG 780 

30 ATCATACATG CAAATGCCCC TTGTTCATCT GTGTCTTCTG CAAACTAGTC TCATGAAGAA 840 

TTCTGGCGTG CAGCAGGGTA GCTGAAGTTT GGGTCTGGGA CTGGAGATTG GCCATTAGGC 900 

NTCNCTGAGA TTCCAGCTCC CTTCCACCAA GCCCAGTCTT GCTACGTGGC ACAGGGCAAA 960 

35 

CCTGACTCCC TTTGGGCCTC AGTTTCCCCT CCCCTTCATG AAATGAAAAG AATACTACTT 1020 

TTTCTTGTTG GTCTAGCATT GCTGGACACA AAGTGTAGTC ATTATTGTTG TATTGGGTGA 1080 

40 TGTGTGCAAA ACTGCAGAAG CTCACTGCCT ATAAGAQGAA ATAAGAGAGA AAGTGGAGGA 1140 

GAGGGACAAA AGGAGTAATT ATTTGGTATA GATCCACCCA TCCCAACCTT TCTCTCCTCA 1200 

GTCCCTGCTC CTCATGTTTC TGGTTTGGTG AGTCCTTTGT GCCACCACCC ATAATGCTTT 1260 

45 

GCATTGCTGC ATCCTGGGAA GGGGGTATAT GGTCTCACAA GTTGTTGTCA TTG yri ' l TTT 1320 

GCATGCTTTC TTAATAAAAA AAAAAAAAAA ATGTTTANAG TTTTATCTTA AAAAAAAAAA 1380 

50 AAAAAAAAAA ACCC 1394 



55 (2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS : double 



: <W0_9839448A2JA> 



10 



wo 98/39448 PCT/US98/04493 

325 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

5 GCMRCGAGGC GGAGTAAAGG GACTTGAGCG AGCCAGTTGC OGGATTATTC TATTTCCCCT 60 

CCCTCTCTCC CGCCCCGTAT CTCTTTTCAC CCTTCTCCCA CCCTCGCTCG CGTACCATGG 120 

CGGAGCGTCG GCGGCCACTC AGTCCCATTC CATCTCCTCG TCGTCCTTCG GAGCCGAGCC 180 

GTCCGCGCCC GGCGGCGGCG GGAGCCCAGG AGCCTGCCCC GCCCTGQGGA CGAAGAGCTG 240 

CAGCTCCTCC TGTGCGGTGC ACGATCTGAT TTTCTGGAGA GATGTGAAGA AGACTGGGTT 300 

15 TGTCTTTGGA CACGCTGATC ATGCTGCTTT CCCTGGCAGC TTTCAGTGTC ATCARTGTGG 360 

GTTTCTTAMC TCATCCTGGC TCTTCTCTCT GTCACCATCA RCTTCAGGAT CTACAAGTCC 420 

GTCATCCAAG CTGTOCAGAA RTCAGAARAA GGCCATCCAW TCCAAAGCCT ACCTQGACGT 480 

20 

AGACATTACT CTGTCCTCAG AAGCTTTCCA TAA1TACATG AATGCTGCCA TGGTGCACAT 540 

CAACAGGGCC CTGAAACTCA TTATTCGTCT CTTTCTGGTA GAAGATCTGG TTGACTCCTT 600 

25 GAAGCTGGCT GTCTTCATGT GGCTGATGAC CTATGTTGGT GCTGTTTTTA ACGGAATCAC 660 

CCTTCTAATT CTTGCTGAAC TGCTCATTTT CAGTGTCCCG ATTGTCTATG AGAAGTACAA 720 

GACCCAGATT GATCACTATG TTGGCATCGC CCGAGATCAG ACCAAGTCAA TTGTTGAAAA 780 

GATCCCAAGC AAAA 794 



30 



35 



45 



(2) INFXDRMATION FOR SEQ ID NO: 103: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1544 base pairs 
40 (B) TYPE: nucleic acid 

<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

TTTGCTTGCT AGTCTGAACC AAAGAGTTGT TTGGGCATTT GCTGTGTTGG CCATTTCTGG 60 

AGCAAGAGGG TCTTCTTCCT CCTTCCCCCA GCCAGCCAGC TGTCCTGGGG CCAGGCTTTC 120 

50 CTGGGTGGAA AGAAGTATAC CTTTCCCTGG QGCCCTAGGA TAGCAAAGTG AGCCATAGTG 180 

GGCCAGGCTG CCCTCCATGC TGGGCCCCAG CCCAGGTCTG CACTCGCCTG GATCACCTTC 240 

TTTGAGCCTT AGCCATCTCC TGTCAGGTAG GAATGAACTT GCCAGCCTTC AGGYTCGTTC 30O 

55 

AGCTATGACC ATCTGTGCGG TCAGGGTACA CTCAGCTCTC CTCCCCAACT CCAGCAGCCT 360 

TTAAGAAGTG TCCCTTTGGC GCCCCCTGGA GGCAGAGCAC TGAGCTGGAC CCTGGGTAGA 420 

60 CTCCCACAGG GAGGACGGAG CTGGCCTCAG GAGTGGGACA CCCAGACTTG GCAGGGCCTT 480 
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CAAGAGGCCT GTGTGGGGGC CCCAGGAATC CTTAGCTGAA GCGGGGAGAC TCACTCTCCA 540 

TCTCAGGAAA TTCTAGCCCT TGCCCTCAGG GAGCCACGGT TGAGGGTGAG GCCCAACACC 600 

-5 

TGCGTTAGGG CCCTGGGTGG GCAAGTCTGG GCCCTGGGGT AGGGAGQGAG ACTCAGGCCC 660 

ACACTTGGGT ATTTTCTAAT TTCAGACAAA CACACACTCA GCGCCSCACTC ACTGATTCCT 720 

10 ACACATTGCC AAGATTTCAC ACATGTGACC AGGGGCCACC AAAGTCCCTG TGACCTTTGr 780 

GACTAGGATC CTAATTTCTC TATTTTCTCC TGGGTGCXrTG GGTCTGTGTC ACCTGGGGCA 840 

GTCTGGATAA TGTTTAGTTC TGTGACACTG TTTTTTGGGG GTGGCACCTG GTTCTCCGAT 900 

15 

GCCTGGGCTG GTGTCAGGCC CAGGACTGTA GTGCTGGGAG CAGTAAAGCT CAGCTCTGTG 960 

TAATCAGTGA TGCTATGGCT TGCTCGTGTC TTATGATCCA ATCCTTTTCT ACATCAGCCC 1020 

20 TTCTTTTGTT TTATGGCTAG TCTTATCTGG CXTTGCTTATT TCCTTGCGGG GAGGAGAGGG 1080 

TTTGCTAATC TGCTCCCAGC CCAACCTATT ACCACCCCAC CTCGCTGCX3A CCTACTGCTC 1140 

GGGAGGCAGC AGACAGGGAG CCACX:AGCAG TGGCTTCCTG GCCCTGTGCT GGGGGTGGGG 1200 

25 

GGAAGCTGGG GGCACATGTG GCCCTTGCCT TCTGAGCAGC TCCCAGTGCC AGGGCTTTGA 1260 

GACTTTCCCA CATGATAAAA GAAAAGGGAG GTACAGAAGT TCCAATTCCC TTTTTATTTT 1320 

30 GCTGGTTGGT ATCTGTAAAT GTTTAATAAA TATCTGAGCA TGTATCTATC AACGCCAAGA 1380 

ATTTCAAAGT CTCCTTCAAC AATATGAGGC TTTTAGGATG TTTATATTCC TTCATCCXTC 1440 

TTCTTTCCCA GGTTTTGCAG GGAAAAAAAG TCTGGAATTA TAGATACAGC 1TATTATTAA 1500 

35 

- ATTTGTTCTT GCATAAAAAA- AAAAAAAAAA-AAC3«GNNGGG GGGG- _1544 



40 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 base pairs 
45 <B) TYPE: nucleic acid 

(C) STRANDEDNESS r double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

50 

ACCCACGCGT CCGNCTTGTC CACCCGGGGG CGTGGGAGTG AGGTACCAGA TTCAGCCCAT 60 

TTCGCCCCGA CGCCTCTGTT CTCGGAATCC GGGTGCTGCG GATTGAGGTC CCGGTTCCTA 120 

55 AGGTGGGTCG CTGTCCACCC GGGGGCGTGG GAGTGAGGTA CCAGATTCAG CCCATTTGGC 180 

CCCGACGCCT CTGTTCTCGG AATCCGGGTG CTGCGGATTG AGGTCCCGGT TCCTAACGGA 240 

CTCCAAGATG GAGGAAGGCG GGAACCTAGG AGGCCTGATT AAGATGGTCC ATCTACTGGT 300 
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CTTGTCAGGT GCCTGGGGCA TGCAAATGTG GGTGACCTTC GTCTCAGGCT TTCCTGCTTT 360 

TCCGAAGCCT TCCCCGACAT ACCTTCGGAC TAGTGCAGAG CAAACTCTTC CCCTTCTACT 420 

'5 TCCACATCTC CATGGGCTGT GCCTTCATCA ACCTCTGCAT CTTGGCTTCA CAGCATGCTT 480 

GGGCTCAGCT CACATTCTGG GAGGCCAGCC AGCTTTACCT GCTGTTCCTG AGCCTTACGC 540 

TGGCCACTGT CAACGCCCGC TGGCTGGAAC CCCGCACXIAC AGCTGCCATG TGGGCCCTGC 600 

10 

AAACCGTGGG AGAAGGAGCG AGGCCTGGGT GGGGAGGTAC CAGGCAGCCA ACAGGTTCCC 660 

GATCCTTAAC GCCAGNTGCG AGAGAAGGAC CCCAAGTACA GTGCTCTCCG CCAGAATTTC 720 

15 TTCCGCTACC ATGGGCTGTC CTCTCTTTGC AATCTGGGCT GCGTCCTGAG CAATGGGCTC 780 

TGTCTCGCTG GCCTTGCCCT GGAAATAAGG AGCCTCTAGC ATQGGCCCTG CATGCTAATA 840 

AATGCTTCTT CAGAAAAAAA AAAAAAAAAA A 871 

20 



25 



(2) INFORMATION FOR SEQ ID NO: 105: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

GGCACGAGTT ATAGCATGGC ATTCATACTT TTGITTTATT GCCTCATGAC TTTTTTGAGT 60 

35 

TTAGAACAAA ACAGTGCAAC CGTAGAGCCT TCTTCCCATG AAATTTTGCA TCTGCTCCAA 

AACTGCTTTG AGTTACTCAG AACTTCAACC TCCCAATGCA CTGAAGGCAT TCCTTGTCAA 180 

40 AGATACCAGA ATGGGTTACA CATTTAACCT GGCAAACATT GAAGAACTCT TAATGTTTTC 240 

TTTTTAATAA GAATGACGCC CCACTTTGGG GACTAAAATT GTGCTATTGC CGAGAAGCAG 300 

TCTAAAATTT ATTTTTTTAA AAAGAGAAAC TGCCCCATTA TTTTGGTGGG GTTGGTTTTT 360 

AATTTNTAAT NTGAAAAATT TTTTTGGGGT TTTTGGGGCC ATGG 404 



45 



50 



60 



(2) INFORMATION FOR SEQ ID NO: 106: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1542 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 106: 
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GTCAGACAGG TGGAC3CCGCC GGGGC7VGGAG TCTCAAAGAG CCAGGCTCCA GGAGAGGAAG 60 

GGCTCTRCGA GAGGAGAGAG GAGAGCGCTG GAGAGGAGAG GCTGGAGAGT OCTTAGCCAG 120 

' 5 GATGGAGGCT GTTGTGAACT TGTACCAAGA GGTGATGAAG CACGCAGATC CCCGGATCCA 180 

GGGCTACCCT CTGATGGGGT CCCCCTTGCT AATGACCTCC ATTCTCCTGA CCTACGTGTA 240 

CTTCGTTCTC TCACTTGGGC CTCGCATCAT GGCTAATCGG AAGCCCTTCC AGCTCCXTrGG 300 

10 

CTTCATGATT GTCTACAACT TCTCACTGGT GGCACTCTCC CTCTACATTG TCTATGAGTT 360 

CCTGATGTCG GGCTGGCTGA GCACCTATAC CTGGCGCTGT GACCCTGTGG ACTATTCCAA 420 

15 CAGCCCTGAG GCACTTAGGA TGGTTCGGGT GGCCTGGCTC TTCCTCTTCT CCAAGTTCAT 480 

TGAGCTGATG GACACAGTGA TCTTTATTCT CCGAAAGAAA GACGGGCAGG TGACCTTCCT 540 

ACATGTCTTC CATCACTCTG TGCTTCCXTTG GAGCTGGTGG TGGGGGGTAA AGATTGCCCC 600 

20 

GGGAGGAATG GGCTCTTTCC ATGCCATGAT AAACTCTTCC GTGCATGTCA TAATGTACXTT 660 

GTACTACGGA TTATCTGCCT TTGGCCCTGT GGCACAACCC TACCTTTGGT GGAAAAAGCA 720 

25 CATGACAGCC ATTCAGCTGA TCCAGTTTGT CCTGGTCTCA CTGCACATCT CCCAGTACTA 780 

CTTTATGTCC AGCTGTAACT ACCAGTACCC AGTCATTATT CACCTCATCT GGATGTATGG 840 

CACCATCTTC TTCATGCTGT TCTCCAACTT CTGGTATCAC TCTTATACCA AGGGCAAGCG 900 

30 

GCTGCCCCGT GCACTTCAGC AAAATGGAGC TCCAGGTATT GCCAAGGTCA AGGCCAACTG 960 

AGAAGCATGG CCTAGATAGG CGCCCACCTA AGTGCCTCAG GACTGCACCT TAGGGCAGTG 1020 

35 TCCGTCAGTG CCCTCTCCAC CTACACCTGT GACCAAGGCT TATGTGGTCA GGACTGAGCA 1080 

GGGGACTGGC CCTCCCCTCC CCACAGCTGC TCTACAGGGA CCACGGCTTT GGTTCCTCAC 1140 

CCACTTCCCC CGGGCAGCTC CAGGGATGTG GCCTCATTGC TGTCTGCX:AC TCCAGAGCTG 1200 

40 

GGGGCTAAAA GGGCTGTACA GTTATTTCCC CCTCCCTGCC TTAAAACTTG GGAGAGGAGC 1260 

ACTCAGGGCT GGCCCCACAA AGGGTCTCGT GGCCTTTTTC CTCACACAGA AGAGGTCAGC 1320 

45 AATAATGTCA CTGTGGACCC AGTCTCACTC CTCCACCCCA CACACTGAAG CAGTAGCTTC 1380 

TGGGCCAAAG GTCAGGGTGG GCXXX3GGCCT GGGAATACAG CCTGTGGAGG CTGCTTACTC 1440 

AACTTGTGTC TTAATTAAAA GTGACAGAGG AAACCANAAA AAAAAAAAAA AAAAACTCGA 1500 

50 

GGGGGGCCCG TACCCAAATC GCCGGTATGA TCGTAAACAA TC 1542 



55 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2327 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

GGTAGCTCAN TGCAGTGAAA TAGTCTTACT GGAAACAAAG CCCTTTATCA AGAATAATTA 60 

ACTCTTCCCT TTTCTTTTTG GAGAGGTGCT TTGTTTCTGA TCGGACCATT TCACTGCAGC 120 

AAGCAACACA GTATTCTRAG CAGAAGATCG GGACTTGAGG CCATGTTGCG GAGGGCCAGT 180 

RACATTATCT GGACTCTGGA GTGTGAGGAA TATGGACTCC ACTCTTCACT ATATTCACAR 240 

CGATTCAGAC TTGAGCAACA ATAGCAGTTT TAGCCCTGAT GAGGAAAGGA GAACTAAAGT 300 

ACAAGATGTT GTACCTCAGG CGTTGTTAGA TCAGTATTTA TCTATGACTG ACCCTTCTCG 360 

TGCACAGACG GTTGACACTG AAATTGCTAA GCACTGTGCA TATAGCCTCC CTGGTGTGGC 420 

CTTGACACTC GGAAGACAGA ATTGGCACTG CCTGAGAGAG ACGTATGRGA CTYTGGCCTC 480 

AGACATGCAG TGGAAAGTTC GACGGAACTC TAGCATTCTC CATCCACGRG CTTGCAGTTA 540 

TTCTTGGAGA TCAATTGACA GCTGCAGATC TGGTTCCAAT TTTTAATGGA TTTTTAAAAG 600 

ACCTCGATGA AGTCAGGATA GGTGTTCTTA AACACTTGCA TGATTTTCTG AAGCTTCTTC 660 

ATATTGACAA AAGAAGAGAA TATCTTTATC AACTTCAGGA GTTTTTGGTG ACAGATAATA 720 

GTAGAAATTG GCGGTTTCGA GCTGAACTGG CTGAACAGCT GATTTTACTT CTAGAGTTAT 780 

ATAGTCCCAG AGATGTTTAT GACTATTTAC GTCCCATTGC TCTGAATCTG TGTGCAGACA 840 

AAGTTTCTTC TGTTCGTTGG ATTTCCTACA AGTTGGTCAG CGAGATGGTG AAGAAGCTGC 900 

ACGCGGCAAC ACCACCAACG TTCGGAGTGG ACCTCATCAA TGAGCTTGTG GAGAACTTTG 960 

GCAGATGTCC CAAGTGGTCT GGTCGGCAAG CCTTTGTCTT TGTCTGCCAG ACTGTCATTG 1020 

AGGATGACTG CCTTCCCATG GACCAGTTTG CTGTGCATCT CATGCCGCAT CTGCTAACCT 1080 

TAGCAAATGA CAGGGTTCCT AACGTGCGAG TGCTGCTTGC AAAGACATTA AGACAT^ACTC 1140 

TACTAGAAAA AGACTATTTC TTGGCCTCTG CCAGCTGCCA CCAGGAGGCT GTGGAGCAGA 1200 

CCATCATGGC TCTTCAGATG GACCGTGACA GCGATGTCAA GTATTTTGCA AGCATCCACC 1260 

CTGCCAGTAC CAAAATCTCC GAAGATGCCA TGAGCACAGC GTCCTCAACC TACTAGAAGG 1320 

CTTGAATCTC GGTGTCrTTC CTGCTTCCAT GAGAGCOGAG GTTCAGTGGG CATTCGCCAC 1380 

GCATGTGACC TGGGATAGCT TTCGGGGGAG GAGAGACCTT CCTCTCCTGC GGACTTCATT 1440 

GCAGGTGCAA GTTGCCTACA CCCAATACCA GGGATTTCAA GAGTCAAGAG AAAGTACAGT 1500 

AAACACTATT ATCTTATCTT GACTTTAAKG KKWAWKMMWW KCTCACaiSRA TTATAMTTSW 1560 

CWMMRARGSM WYMAAWSCTK SWC3CTCYWCC KSRSTGPMKG MMRCTCTAGA AYTRGYRGAK 1620 

CMVYYKSGCT KMWGGAAKKS GGCASGAGCC AGAGACCTGC ATTGCTTTCT CCTGGTTTTA 1680 
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TTTAACAATC GACAAATGAA ATTCTTACAG CCTGAAGGCA GACGTGTGCC CAGATGTGAA 1740 

AGAGACCTTC AGTATCAGCC CTAACTCTTC TCTCCCAGGA AGGACTTGCT GGGCTCTGTG 1800 

-5 

GCCAGCTGTC CAGCCCAGCC CTGTGTGTGA ATCGTTTGTG ACGTGTGCAA ATGGGAAAGG 1860 

AGGGGTmr ACATCrcCTA AAGGACCTGA TGCCAACACA AGTAGGATTG ACTTAAACTC 1920 

10 TTAAGCGCAG CATATTGCTG TACACATTTA CAGAATGGTT GCTGAGTGTC TGTGTCTGAT 1980 

TTTTTCATGC TGGTCATGAC CTGAAGGAAA TTTATTAGAC GTATAATGTA TGTCTGGTGT 2040 

TTTTAACTTG ATCATGATCA GCTCTGAGGT GGAACTTCTT CACATACTGT ACATACCTGT 2100 

15 

GACCACTCTT GGGAGTGCTG CAGTCTTTAA TCATGCTGTT TAAACTGTTG TGGCACAAGT 2160 

TCTCTTGTCC AAATAAAATT TATTAATAAG ATCTATAGAG AGAGATATAT ACACmTGA 2220 

20 TTGTTTTCTA GATGTCTACC AATAAATGCA ATTTGTGACC TGTAAAAAAA AAAWAAAAAA 2280 

ACTCGAGGGG GGCCCGGTAC CCAAATCGCC GATATGATCT AANCATC 2327 



25 



(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1062 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 



GGCCGCCGAG GCGCAACAGC CGTTCTGTCA GCTCTGGGTC CAACCGGACT AGCGAANATC 60 

TTCCTCATCC TCATCATCGT CTTCCTCATC CCGATCTCGG TCCAGGTCCC TCTCCCCCCC 120 

40 

ACACAAGAGG TGGCGAAGGT CCAGCTGTAG TTCCTCTGGA CGTTCTCGAA GATGCTCTTC 180 

CTCTTCTTCG TCATCATCTT CCTCTTCGTC TTCCTCATCC TCATCATCCA GnTTCTCGAAG 240 

45 CCGCTCACGA ATCCCCATCC CCCCGCCGGA GRAAGTGACA GGAGGCQGCG GTACAGCTCT 300 

TATCGTTCAC ATGACCATTA CCAAAGGCAA AGAGTGCTAC AAAAGGAGCG TGCAATAGAA 360 

GAAAGAAGGG TGGTCTTCAT TGGAAAGATA CCTGGCCGCA TGACTCGATC AGAGCTGAAA 420 

50 

CAGAGGTTCT CCGTTTTTGG AGAGATTGAG GAGTGCACCA TCCACTTCCG TGTCCAAGGG 480 

GACAACTACG GCTTCGTCAC TTATCGCTAT GCTGAGGAGG CATTTGCAGC CATTGAGAGT 540 

55 GGCCACAAGC TGCGGCAGGC AGATGAGCAG CCCTTTGATC TCTGCTTTGG GGGCCGAAGG 600 

SWGTNCTGCA AGAGGAGCTA TTCTGATCTT GACTCCAACC GGGAAGACTT TGACCCAGCA 660 

CCTGTAAAGA GCAAATTTGA TTCTCTTGAC TTTGACACAT TGTTGAAACA GGCCCAGAAG 720 

60 
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AACCTCAGGA GGTAACCTTG GGCCCTTCCC TGCTATCCTT TTTCTCCTTT GGAGGTGCCC 780 

AACCTCCTCC ACCCCCTTCC CCTACTCTAG GGGAGAGAGC TGCTAGTGAG ATGACTGTTT 840 

TATAAAGAAA TGGAAAAAAG TGAAATAAAA AATATGTTGA ATCAGATTTT TTAAAAGGGG 900 

TATTTGTTTT TTTATAACAG GTATTGAAAC AAGTTAACTT GCATTCCTAT GTAAGATAGG 960 

AGGGGCTGAG GGGATCCCCA GTGTTTGGAA CATAAGTCAC TATGCAGACT AATAAACATC 1020 

AACTAGAGAG NAAAAAAAAA AAAAAAAAAA ATTTAAAAAA CT 1062 
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(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

GAGAGACTCA CACTTCITTT CCATTATCAC OXSACGATGTA GTGGACATAG CAGGGGAAGA 60 

GCACCTACCT GTGTOGGTGA GGTTTGTTGA TGAATCTCAT AACCTAAGAG AGGAATTTAT 120 

AGGCTTCCTG CCTTATGAAG CCGATGCAGA AATTTTGGCT GTGAAATTTC ACACTATGAT 180 

AACTGAGAAG TGGGGATTAA ATATGGAGTA TTGTCGTGGC CAGGCTTACA TTGWCTCTAG 240 

TGGATTTTCT TCCAAAATGA AAGTTGTTGC TTCTAGACTT TYAA05KMRA TOKCCCCMAK 300 

YWAWCKGAAC AMAMKCTGSW CYTCCWSYGC SKTRRMKRYC GYKSTATRRC WARWKSAKYM 360 

CCYGKK^r^GS RRGTAWYTSK TGCAYKAGGG AACAATTGAG GAAGTTTGTT CTTTTTTCCA 420 

TCGATCACCA CAACTGCTTT TAGAACTTGA CAACGTAATT TCTGTTCTTT TTCAGAACAG 480 

TAAAGAAAGG GGTAAAGAAC TGAAGGAAAT CTGCCATTCT CAGTGGACAG GCAGGCATGA 540 

TGCTTTTGAA ATTTTAGTGG AACTCCTGCA AGCACTTGTT OTATGTTTAG ATGGTATAAA 600 

TAGTGACACA AATATTAGAT GGAATAACTA TATAGCTGGC CGAGCATTTG T/^CTCTGAGT 660 

GCAGTGTCAG ATTTTGATTT CATTGTTACT ATTGTTGTTC TTAAAAATGT CCTATCTTTT 720 

ACAAGAGCCT TTGGGAAAAA CYYCMAGGGG CAAACCTCTG ATGTCTTCTT TGCKKMMSRT 780 

ARmTTTGAY ATRMARYACT RMtfTKSAYTY AAYGRWGTGA CWSGAWAATA TTRAASTYTA 840 

TACAATKAAT YWTRRYTTSM KRMACaiYAAT CCGAAAYTGT GGMAAMYAAA CTTGATATTC 900 

AAATGAAACT CCCTGGGAAA TTCCGCAGAG CTCACCAGGG TAACTTGGAA TCTCAGCTAA 960 

CCTCTGAGAG TTACTATAAA GAAACCCTAA GTGTCCCAAC AGTGGAGCAC ATTATTCAGG 1020 

AACTTAAAGA TATATTCTCA GAACAGCACC TCAAAGCTCT TAAATGCTTA TCTCTGGTAC 1080 
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CCTCAGTCAT GGGACAACTC AAATTCAATA CX3TCGGAGGA ACACCATGCT GACATGTATA 1140 

GAAGTGACTT ACCCAATCCT GACACGCTGT CAGCTGAGCT TCATTGTTGG AGAATCAAAT 1200 

-5 

GGAAACACAG GGGGAAAGAT ATAGAGCTTC CGTCCACCAT CTATGAAGCC CTCCACCTGC 1260 

CTGACATCAA G T T I TTTCCT AATGTGTATG CATTGCTGAA GGTCCTGTGT ATTCTTCCTG 1320 

10 TGATGAAGGT TGAGAATGAG CGGTATGAAA ATGGACGAAA GCGTCTTAAA GCATATTTGA 1380 

GGAACACTTT GACAGACCAA AGGTCAAGTA ACTTGGCTTT GCTTAACATA AATTTTGATA 1440 

TAAAACACGA CCTGGATTTA ATGGTGGACA CATATATTAA ACTCTATACR AKTAMC3TCAG 1500 

15 

MC3CTVYCTAC AKAYRAVTCM SWAVMTGTGG AAARYWSSTA MGMSWGCWKK TAMMRRIMCG 1560 

GMWWTYYYMK RKIYGAYMYW YGCGMMCGAG AAAAAGCCGT AAGGTGTATG TAGACCACTT 1620 

20 AATCACTAAA TATCTTTGCC TATAGGACTC CATTGAATAC ATTAGCCATT GATAATCTAC 1680 

CTGTTTAAAT GGCCCCTGTT TGAACTCTCA AGCTTTC5AAG ACCTACCTGT TCTTCCAGAA 1740 

GAGAACGTTG AAAGTGCX:AT GTTTCCTTTT GCGTGATCTC TGTTGATGGC ACTCTGGAAT 1800 

25 

TGTTTCCAGT TTAAKTCATT TTAGACATAG CATTTATTAT CAOTGTGGAT CTCTACTTGT 1860 

TGGGTGTTAT GAATTCTTTG AAGAATATAT TTTGAAGAGG TGTGGGAGGA AGGAATACAT 1920 

30 TTTATAAAAT GTTGTAGTGA AGCCCACAAT TGACCTTKGA CTAATAGGAG TTTTAAGTAT 1980 

GTTAAAAATC TATACTGGAC AGTTACAAGA AATTACCGGA GAAAAGCTTG TGAGCTCACC 2040 

AAACAAGGAT TTCAGTGTAG ATTTTGTCTT TCTTGAACTT AAAGAAACAA ATGACAAAGT 2100 

35 

TTGAATGGAA AAGCCTGCTG-TTGTTCGAGA TC^ GTTTGTGGAG 216CX 

CCTACATCTT CCTAAGCTTT TTAGCAGGTA TATGTTGAAC ACTTCTGTTT CATGGTTGAG 2220 

40 ACAGAATCAG AGGCCATGGA TACTGACAAC TGATTTGTCT GTTTnTTTC TCTGTCTTTT 2280 

TCCATGACTC TTATATACTG CCTCATCTTG ATTTATAACX: AAAACCTQGA AAACCTACAA 2340 

AATAAGTGTT GTGGTTTATC TAGAAAAATA TGGAAAATAT TGCTGTTATT TTTGOTGAAG 2400 

45 

AAAATCAATT TTGTATAGTT TATTTCAATC TAAATAAAAT GTGAATTTTG TTWWATTAAA 2460 

AATTWGGSAC AAABTBOTGG GGGOTCCAAA CHTWVTCGHG KAAMTTCTCT WAARMATYTK 2520 

50 aItaaacmsct TCACAATTC 2539 



55 (2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1751 base pairs 
{B) TYPE: nucleic acid 
60 (C) STRANDEDNESS : double 
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(D) TOPOLCX3Y: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

AGCATGAAGC CGATGGCCGT GGTGGCCAGT ACCGTCCTGG GCCTGGTGCA AAACATGCGT 60 

GCGTTTGGCG GGATCCTGGT GGTGGTCTAC TACGTATTTG CCATCATTGG GATCAACTTG 120 

TTTAGAGGCG TCATTGTGGC TCTTCCTGGA AACAGCAGCC TGGCCCCTGC CAATGGCTCG 180 

GCGCCCTGTG GGAGCTTCGA GCAGCTGGAG TACTGGGCCA ACAACTTCGA TGACTTTGCG 240 

GCTGCCCTGG TCACTCTGTG GAACTTGATG GTGGTGAACA ACTGGCAGGT GTTTCTGGAT 300 

GCATATCGGC GCTACTCAGG CCCGTGGTCC AAGATCTATT TTGTATTGTG GTGGCTGGTG 360 

TCGTCTGTCA TCTGGGTCAA CCTGTTTCTG GCCCTGATTC TGGAGAACTT CCTTCACAAG 420 

TGGGACCCCC GCAGCCACCT GCAGCCCCTT GCTGGGACCC CAGAGGCCAC CTACCAGATG 480 

ACTGTGGAGC TCCTGTTCAG GGATATTCTG GAGGAGCCCG GGGAC3GATGA GCTCACAGAG 540 

AGGCTGAGCC AGCACCCGCA CCTGTGGCTG TGCAGGTGAC GTCCGGGCTG CCATCCCAGC 600 

AGGGGCGGCA GGAGAGAGAG GCTGGCCTAA CACAGGTGCC CATCATGGAA GAGGCGGCCA 660 

TGCTGTGGCC AGCCAGGCAG GAAGAGACCT TTCCTCTGAC GGACCACTAA GCTGGGGACA 720 

GGAACCAAGT CCTTTGCGTG TGGCCCAACA ACGATCTACA GAACAGCTGC TGGTGCTTCA 780 

GGGAGGCGCC GTGCCCTCCG CTTTCTTTTA TAGCTGCTTC AGTGAGAATT CCCTCGTCGA 840 

CTCCACAGGG ACdTTCAGA CAAAAATGCA AGAAGCAGCG GCCTCCCCTG TCCCCTGCAG 900 

CTTCGGTGGT GCCTTTGCTG CCGGCAGCCC TTGGGGACCA CAGGCCTGAC CAGGGCCTGC 960 
ACAGGTTAAC CGTGAGTCTG" TCTCATCTAT" t^CACSOTSG ' " 1020" 

ATTTTAGCCC AGCACCACAG GGTACGTTCC AGTTTTTCTC TCTTTCCATA GCTGTAAGGC 1080 

CCTTTCTGGG AATGGTTCTC ATTCTCCTTA ATCTATTATT GGGTCAGTTT TCCTGCATGT 1140 

CCCCAGCCTC CCATCACTGC CACCCACTCC CCACAGAGAT GCCCTGCTCA TCCGACTGGG 1200 

GCTTTGACTC CCACACTGTG TACCCCTCTT GTGTGGACGC CCTGCTGCCA AAACCTTCAG 1260 

CAAACAGCTT TCCAAATGGA AGTTGTCACT GTCAGGCCTT TACAATCAGC AACAGCAAAA 1320 

TCTACATGCT GCTGAGQGTC CTGCCTCATT AAGATGCAAT AAATATGTAA GTACATAAAA 1380 

ACAGCAATAG AAGAAACGTA ATGCTTTATT CTCAAATATG ATGTCTACAT AGAAAAGCCA 1440 

AAATTATTAA GAATAGTAAG AATTCACCCA GCACTTTGGG AGGCCGAGGC GGGTGGATCA 1500 

TGAGGTCAGG AGATCGAGAC CATCCTGGCT AACAGGGTGA AACCCCGTCT CTACTAAAAA 1560 

TACAAAAAAT TGGCCGGGCG CAGTGGCGGG CGCCTGTGGT CCCAGCTACT GGGGAGGCTG 1620 

AGGCAGGAGA ATGGCGTGAA CCCGGGAAGC GGAGCTTGCA GTGAGCCGAG ATTGCGCCAC 1680 
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TGCAGTCCGC AGTCCAGCCT GGGCGACAGA GCGAGACTCC GTCTCAAAAA AAAAAAAAAA 
AAAAAAAAAA A 
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(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICttl: SEQ ID NO: 111: 

AATGTTGTGG TGGTAGCATT TGGGTTAATT CTRATTATAG AGTCTCTTGG AGAGCAATGT 60 

CCATAAACTA ATCCCAAACA ACATTGTCTT TTTRATGTTG TAGTGAACAG CAGAGAATTT 120 

CAAAGGACCT TGCTAATATC TGTAAGACGG CAGCTACAGC AGGCATCATT GGCTGGGTGT 180 

ATGGGGGAAT ACCAGCTTTT ATTCATGCTA AACAACAATA CATTGAGCAG AGCCAGGCAG 240 

AAATTTATCA TAACCGGTTT GATGCTGTGC AATCTGCACA TCGTGCTGCC ACAOGAGGCT 300 

TCATTCGTTA TGGCTGGCGC TGGGGTTGGA GAACTGCAGT GTTTGTGACT ATATTCAACA 360 

CAGTGAACAC TAGTCTGAAT GTATACCGAA ATAAAGATGC CTTAAGCCAT TTTGTAATTG 420 

CAGGAGCTGT CACGGGAAGT CTTTTTAGGA TAAACGTAGG CCTGCGTGGC CTGGTGGCTG 480 

GTGGCATAAT TGGAGCCTTG CTGGGCACTC CTGTAGGAGG CCTGCTGATG GCATTTCAGA 540 

AGTACTCTGG TGAGACTGTT CAGGAAAGAA AACAGAAGGA -TCGAAAGGCA -CTCGATGAGC eOO- 

TAAAACTGGA AGAGTGGAAA GGCAGACTAC AAGTTACTGA GCACCTCCCT GAGAAAATTG 660 

AAAGTAGTTT ACAGGAAGAT GAACCTGAGA ATGATGCT7WV GAAAATTGAA GCACTGCTAA 720 

ACCTTCCTAG AAACCCTTCA GTAATAGATA AACAAGACAA GGACTGAAAG TGCTCTGAAC 780 

TTGAAACTCA CTQGAGAGCT GAAGGGAGCT GCCATGTCCG ATGAATGCCA ACAGACAGGC 840 

CACTCTTTGG TCAGCCTGCT GACAAATTTA AGTGCTGGTA CCTGTGGTGG CAGTGGCTTG 900 

CTCTTGTCTT TTTCTTTTCT TTTTAACTAA GAATGGGGCT GTTGTACTCT CACTTTACTT 960 

ATCCTTAAAT TTAAATACAT ACTTATGTTT GTATTAATCT ATCAATATAT GCATACATGA 1020 

ATATATCCAC CCACCTAGAT TTTAAGCAGT AAATAAAACA TTTCGCAAAA GATTAAAGTT 1080 

GAATTTTACA GTTAAAAAAA AAAAAAAAAA AAAAAAA 1117 



60 



(2) INFORMATION FOR SEQ ID NO: 112: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
-5 <D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 112: 

GGCAGAGGTT TTCTTATATT TTAAGTAAAT TTAAAGTGGC TATCAGAATA TTTATTCTTG 60 

10 

TTTGAGACTA CCAACATAAC TACGTGTTGA AGGTGCTTCA CAGAGAATAT ATTGCCTTTA 120 

ATGTGAAATA ATTTTCACCA ATGTTGCTAA CTTTAATAAA GTATAAAATT TGTAGAATAT 180 

15 TCAGTTAAGT AGTTGGTAAC CCTTTTCTAT TTTAGTAAAA CTTAATGCAT GTTTACTTTT 240 

TTTTGAAAGA TGCAGACAAT CTCTTTGAAC ATGAATTGGG GGCTCTCAAT ATGGCTGCAT 300 

TACTACGAAA AGAAGAAAGA GCAAGTCTTC TTAGTAATCT TGGCCCATGT TGTAAGGCGT 360 

20 

TGTGCTTCAG ACGGGATTCT GCAATTCGAA AGCAGCTTGT TAAAAATGAG AAGGGCACCA. 420 

TAAAACAAGC TTACACGAGT GCTCCAATGG TAGACAATGA ATTACTTCGA TTGAGTCTTC 480 

25 GGTTATTTAA GCGGAAGACT ACTTGCCATG CTCCAGGACA TGAAAAGACT GAAGATAATA 540 

AACTTTCACA GTCCAGTATC CAACAGGAAC TGTGTGTGTC TTAAGACCGA AGTTACAATA 600 

TGGTATTTTT GGTACTGTCT TCCTTCAGCA GTGCATATTC TTTTGCAAAG TT C TTTGGTT 660 

30 

TGACAAGCAT TAGTGACAAA GGCAGAAAAG ATTTATCAGC CATGCTAAAA GAGTGAAGAA 720 

TTTTGATCTT TAGAGACACT AGTTTTGGCC AACTTAAGAT TTTACGTTAA TTTTTACATA 780 

35 GTATTTGACA CTCATGCAAA ATAATGTGAA AACATCTAGA TTTAGTAGTT TATTCTGCGC 840 

CTTTTGTTAA AACTGAAGAT TTTGGAAAAT" GGTTGTCACT GCTCTTCCAG CCTATGAATA 900' 

TTTTTGTGAA ATGGAACCAT GGATTTATGT CTGGATCATC CATACAGAAC CAACAATTTT 960 

40 

ATTCAAAAAC AATGTGTTCA TCAAAGTAAT TGCTCACATT GTGCAGTACT ATGTTGTACA 1020 

GACCACGTGA AAGGGAATGC TGGTCTAGCT QGCGTGGTAT GTTTATAGGC GAATTTCAGC 1080 

45 AGAAGGAAGC CAAAATAGTT TTTTCCTTTT GAAAGTTTTT TAAAAATTAT TTCATGGGTC 1140 

TrriTTTTAA TTAATATGTG TGCATTGTTA CAATGTATGT TGGGATGTCT TTTGACCCTA 1200 

AATGCTTTTT TTGTTATCAG AGATTGTGTA CTATTTTTAT TTTTAATAAA TGTATCTTCC 1260 

50 

CTTTTMAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 1313 



55 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1654 base pairs 
60 (B> TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

ACAGGGACAG AATACTTTCT TTCCTTCCTT CAAGTACAAG AAGGCTTTCT CTACCATTTG 60 

CGTCTACACT TTATTTTAAA AGCTATCCTT TTCTAGTAGT ATTTTATCAT GGCAATGGCA 120 

TGATGACAAC AACAGTCTTT CATTACAGAC TGAAGQGAAG CATGTCCTTA CTTAAAATAG 180 

TTCTGCTACT TTCCCTCCTA TTATAAGGAA ATTTTACAGA TTCTAAAAAT ACCTTAATTT 240 

TTCTTTGATT TTTATTTTAC CAAGTCACAA ATGTCTTTTT GATGTTTTGA GAATTGTTCT 300 

CATAGAATCA CAAATACTGA CATTTCATTA GATGATTATT TTCCTAGAAT CCCCAAAGAG 360 

CAGTGGCAGT CCATGGCTTG GITGAAGCTA GAAATTTTCC TGCCCCTGGT GACCTGGTAA 420 

GCCTCCTGCT CGGAACCGTG TGAGTGGGTG AGGAAGATGA GAGATGGTCA GATGGAAGAG 480 

AGRAATACAT GAACTGCTCT GGCCTCTCTG GTTCTGTTCT TGGCCCAGAG TTTTTGAAAA 540 

GCAGCGGANA TNGACTGACT TCACATGCTC AGCITTCTCA GCdTTTGTT TATTTTGTTG 600 

TCCTTAGATT TCCCTGTTGT AAAAGGGGCA AGAAAAGTAA CTCATCATCT CTAACACACC 660 

ATGGCAGCTT AGCCAGGTAG TCTTAGTGGT GGTGTTTAGG CATAAGATAT GCTGATCATC 720 

AGTCTCAGGC CACAGTTTCC TTCACTAATC GTCCAGCTTG AGTGTTCTGT TCTCTTCCTG 780 

CCCATTTCCT TGAACCTCCT GCTCTAGCCT TGGCGGAGGG AGAGTGCTAT TTGCTTTTGT 840 

TCTCCCTCTG TCTTAGGAAA AGCCATCTTT AATATAGTTC TTCACCACTG TTGGGGTTGT 900 

TITCTCATIT TTTTTTTCTT CCGAAGAACT CCTGGTTGTT ATTGGMTTT GTM^TTAM' 960 

ACAAATTATT GAATTTTATA AGCTTGTACA CAATATTTAA TTAGTGTGAA AGGAAACAAA 1020 

GAATGCAGGA AAAATAATTT AATATCAACC TCAGTTGACA AGGTGCTCAG ATTATTCAAT 1080 

TCGGGATCCT CCTTTTGTTA GGTTTTTGAG ACAACCCTAG ACCTAAACTG TGTCACAGAC 1140 

TTCTGAATGT TTAGGC7VGTG CTAGTAATTT CCTCGTAATG ATTCTGTTAT TACTTTCCTA 1200 

TTCTTTATTC CTCTTTCTTC TGAAGATTAA TGAAGTTGAA AATTGAGGTG GATAAATACA 1260 

AAAAGGTAGT GTGATAGTAT AAGTATCTAA GTGCAGATGA AAGTGTGTTA TATACATCCA 1320 

TTCAAAATTA TGCAAGTTAG TAATTACTCA GGGTTAACTA AATTACTTTA ATATGCTGTT 1380 

GAAYCTACTC TGTTCCTTGG CTAGAAAAAA TTATAAACAG GACTTTCTAG TTTGGGAAGC 1440 

CAAATTGATA ATA1TCTATG TTCTAAAAGT TGGGCTATAC ATAAATTATT AAGAAATATG 1500 

GATTTTTATT CCCAGGATAT GGTGTTCATT TTATGATATT ACGCAGGATG ATGTATTGAG 1560 

TAAAATCAGT TTTGTAAATA TGTAAATATG TCATAAATAA ACAATGCTTT GACTTATTTC 1620 

CAAAAAAAAA AAAAAATAAA NTTCGAGGGG GGGC 1654 
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-5 (2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 base pairs 

(B) TVPE: nucleic acid 
10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

15 GGCAAACTTT CCCCCAANGC TTCGAAACTT GCAAGCCGAA ACCTTGAATC GTTAAAAGTT 60 

GGGTTGCCa^C GGCGCCCTGG CCCGAAGAAG CGCAATTGGC GTTCCGCGAA CGTTGGCCCT 120 

CAACGGCTCG GCAGCCAGCC ATGTCCTGCA CCCAGGACAG CGGCCCTGGG CTACAAGGAC 180 

20 

CTGGMCCTCA TCTTCCTGCG CCGACCTGCG CGGGGTAAGG GGWAGTTTCA GACTGTGAAG 240 

GACGTCGTGC TGGACTGCCT GTTGGACTTC TTACCCGAGG- GGGTGAACAA AGAGAAGATC 300 

25 ACACCACTCA CGCTCAAGGA AGCTTATGTG CAGAAAATGG TTAAAGTGTG CAATGACTCT 360 

GACCGATGGA GTCTTATATC CCTGTCAAAC AACAGTGGCA AAAATGTGGA ACTGAAATTT 420 

GTGGATTCCC TCCGGAGGCA GTITGAATTC AGTCTAGATT CTTTTCAAAT CAAATTAGAC 480 

30 

TCTCTTCTGC TCTTTTATGA ATGTTCAGAG AACCCAATGA CTGAGACATT TCACCCCACA 540 

ATAATCGGGG AGAGCGTCTA TGGCGATTTC CAGGAAGCCT TTGATCACCT TTGTAACAAG 600 

35 ATCATTGCCA CCAGGAACCC AGAGGAAATC CGAGGGGGAG GCCTGCTTAA GTACTGCAAC 660 

CTCTTCCTGA "GGGGCTTTAG GCCCGCCTCT CSlTGAAATCA AGACCCtTCA AAGGTATATG 720 

TGTTCCAGGT TTTTCATCGA CTTCTCAGAC ATTGGAGAGC AGCAGAGAAA ACTGGAGTCC 780 

40 

TATTTGCAGA ACCACTTTGT GGGAATTGGA AGACCGCAAG TATGAGTATC TCATGACCCT 840 

TCATGGAGTG GTAAATGAGA GCACAGTGTG CCTGATGGGA CATGAAAGAA GACAGACTTT 900 

45 AAACCTTATC ACCATGCTGG CTATCCGGGT GTTAGCTGAC CAAAATGTCA TTCCTAATGT 960 

GGCTAATGTC ACTTGCTATT ACCAGCCAGC CCCCTATGTA GCAGATGCCA ACTTTAGCAA 1020 

TTACTACATT GCACAGGTTC AGCCAGTATT CACGTGCCAG CAACAGACCT ACTCCACTTG 1080 

50 

GCTACCCTGC AATTAAGAAT CATTTAAAAA TGTCCTGTGG GGAAGCCATT TCAGACAAGA 1140 

CAGGAGAGAA AAAAAAAAAA AAAAAAAAAA A 1171 

55 



(2) INFORMATION FOR SEQ ID NO: 115: 
60 (i) SECKJENCE CHARACTERISTICS: 
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(A) LENGTH: 842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

-5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GGTCTGCGCC GGAAGTGCAT GAGCTGCCGA TGTGGTGCTT AGTGATTGCG GTTTCGGTCG 60 
10 CTCTCCCGTG TTTCCCGGGC TGGGTATTTG CCTCGCACCA TGGCGCCCAA GGGCAAAGTG 120 
GGCACGAGAG GGAAGAAGCA GATATTTGAA GAGAACAGAG AGACTCTGAA GTTCTACCTG 180 
CGGATCATAC TGGGGGCCAA TGCCATTTAC TGCCTTGTGA CGTTGGTCTT CTTTTACTCA 240 

15 

TCTGCCTCAT TTTGGGCCTG GTTGGCCXTG GGCTTTAGTC TGGCAGTGTA TGGGGCCAGC 300 
TACCACTCTA TGAGCTCGAT GGCACGAGCA GCGTTCTCTG AGGATGGGGC CCTGATGGAT 360 
20 GGTGGCATGG ACCTCAACAT GGAGCAGGGC ATGGCAGAGC ACCTTAAGGA TGTGATCCTA 420 
CTGACAGCCA TCGTGCAGGT GCTCAGCTGC TTCTCTCTCT ATGTCTGGTC CTTCTGGCTT 480 
CTGGCTCCAG GCCQGGCCCT TTACCTCCTG TGGGTGAATG TGCTGGGCCC CTGGTTCACT 540 

25 

GCAGACAGTG GCACCCCAGC ACCAGAGCAC AATGAGAAAC GGCAGCGCCG ACAGGAGCGG 600 
CGGCAGATGA AGCGGTTATA GCCATTGACA TTGTGGCCAC AGGCCACTGG CCCTGGGTGG 660 
30 CTCTGTCAGG GTGCACAGCC CCTCATGCCT GGAGCAATGA GGGTCTAGTC CAGGGGCCAA 720 
AAGCAGTCTG AGGTATTGGG TATACTTATA CTCTATAGGG TCGTTGAATA AATGGCTTAG 780 
AATGTGAAAA AAAAAAAAAA AAAAAACTCG AGGQGGGCCC GGTACCCAAT TTCNCCTANA 840 
AT _ _ . . „ , _ _ _ . ^ . ^ ^ ... ^ _ . . . , . 842 



35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 116: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1640 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

GGCACGAGGC GGCGGCAGCG GTGGCGGCGG CGCCCCCCGG CGGGAGCCGT TCCCTTTCCC 60 

GTCGGGGAGC GCGGGGYCGG GGCCCAGGGG ACCCCGGGCC ACGGAGAGCG GGAAGAGGAT 120 

55 GGATTGCCCG GCCCTCCCCC CCGGATGGAA GAAGGAGGAA GTGATCCGAA AATCTGGGCT 180 

AAGTGCTGGC AAGAGCGATG TCTACTACTT CAGTCCAAGT GGTAAGAAGT TCAGAAGCAA 240 

GCCTCAGTTG GCAAGGTACC TGGGAAATAC TGTTGATCTC AGCAGTTTTG ACTTCAGAAC 300 

60 
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35 



40 



45 



TGGAAAGATG ATGCCTAGTA AATTACAGAA GAACAAACAG AGACTCCX3AA ACGATCCTCT 360 

CAATCAAAAT AAGGGTAAAC CAGACTTGAA ATACAACATT GCCAATTAGA CAAACAGCAT 420 

CAATTTTCAA ACAACCGGTA ACCCAAAGTC ACAAATCATC CTAGTAATAA AGTGAAATCA 480 

GACCCACAAC GAATGAATGA ACAGCCACGT CAGCTTTTCT GGGAGAAGAG GCTACAAGGA 540 

CTTTAGTGCA TCAGATGTAA CAGAACAAAT TATAAAAACC ATGGAACTAC CCAAAGGTCT 600 

TCAAGGAGTT GGTCCAGTAG CAATGATGAG ACCCTTTTAT CrGCTGTTGC CAGTGCTTTG 660 

CACACAAGCT CTGCGCCAAT CACAGGGCAA GTCTCCGCTG CTGTGGAAAA GAACXTTGCTG 720 

TTTGGCTTAA CACATCTCAA CCCCTCTGCA AAGCTTTTAT TGTCACAGAT GAAGACTCAG 780 

GAAACAGAAG AGCGAGTACA GCAAGTACGC AAGAAATTGG AAGAAGCACT GATGGCAGAC 840 

ATCTTGTCGC GAGCTGCTGA TACAGAAGAG ATGGATATTG AAATGGACAG TGGAGATGAA 900 

GCCTAAGAAT ATGATCAGGT AACTTTCGAC CGACTTTCCC CAAGAGAAAA TTCCTAGGAA 960 

ATTGAACAAA AATGTTTCCA CTGGCTTTTG CCTGTAAGAA AAAAAATGTA CCCGAGCACA 1020 

TAGAGCTTTT TAATAGCACT AACCAATGCC TTTTTAGATG TATTTTTGAT GTATATATCT 1080 

ATTATTCAAA AAATCATGTT TATTTTGAGT CCTAGGACTT AAAATTAGTC TTTTGTAATA 1140 

TCAAGCAGGA CCCTAAGATG AAGCTGAGCT TTTGATGCCA GGTGCAATCT ACTGGAAATG 1200 

TAGCACTTAC GTAAAACATT TGTTTCCCCC ACAGTTTTAA TAAGAACAGA TCAGGAATTC 1260 

TAAATAAATT TCCCAGTTAA AGATTATTGT GACTTCACTG TATATAAACA TATITTTATA 1320 

CTTTATTGAA AQGGGACACC TGTACATTCT TCCATCGTCA CTGTAAAGAC AAATAAATGA 1380 
TTATATTeGA CAGAAAAAAA AAAAAAAAAW MWSTYGARRR GSRGCMCRSW AYMMARWWCC ' " 1440 

CCWMRTWRGS MKTCSTMTKA YTTACATTCA ACTCTGATCC CX5GGGCCTTA GGTTTGACAT 1500 

GGGAGGTGGG AGGAAGATAG CGCATATATT TGCAGTATGA ACTATTGCCT CTGGGACGTT 1560 

GTGAGGAATT GTGCTTTCAC CAGAATTTCT AAGGATTTCT GGCTTAAATA TCACCTAGCC 1620 

TGTGGTAATT TTTITTCCCT 1640 



50 (2> INFORMATION FOR SEQ ID NO: 117: 

(i) SBQUmCE CHARACTERISTICS: 

(A) LENGTH: 952 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
60 TGAATTTAGN AAACACTTTG GAAAACTCAT AACCTCATCA GAAACTGCCT TTAGCCACAC 60 
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TCCTGACCTT CTAGATGAGT AACAAAAAAA TGAAATAAGT TCTTGGAAAT TAAGCCATTT 120 

ATTTTAATTT GCTATTTTTT TCAATGTTCT AGGTATCTTT AAATTTGTTA TTGTGGAATC 180 

-5 

ATTTTCCTGC CAGATACCTT TATCAAAATT ATTGGCCTCA TGAGAGCTGA AGTAAGTCAG 240 

CTTTTTGGTG AACTTTAGTG GACTTCTGTG AGATTGTAGT TGTACTTTGT ATCTCTAAAT 300 

10 CTAAAGATAG TTTTTTAAAA CTCCCAAAGA AAATCTGCTC TCCITTCTGA TCTAAAAACT 360 

CATCTTTGGG GTAAAGAGTT AAGTGTCCAA AGGTTGTCAC AGTTCATGAG GTCAGAGGGA 420 

GCTAGCCTGG CACCTGGACT CTGCCCATCC ACAGCTGACA GATTCCAACA GAAGTGTATT 480 

15 

TAAATTCTCC AGTAGACAAT GCTGGGTAAG GGAGGGGGTA GGGCTGGGTT ATTAAGATAC 540 

AGGCTGCTGT ATTTTACATT GGTTGTGGGG GAAGGGGAGC CTGGAGAAAA CAAAGTCACT 600 

20 ATTCCCTTTT TTGAAACAGG AAAAAAAATT ATTTTTTGTT CAGTAAAAAT GGTAGAGAAT 660 

TCCAATGTCC CTAGCCACAA GGGACCAGTT CCACTGAG/VA GTGAACAGTG GGAACTCAAA 720 

ATTTCAGAAA CATTGGGGGA AGGGAAAATT GGCnTCTCT TAATTGGCAG ATGTTCCAGT 780 

25 

GGGGSGGGGG GGCTCTGTTT TTGTTGGGAT GTGTTATGTT GTATGTACGC ATATATGGAC 840 

CGGAGTCTGC TGAGTTTATA AGGTTCCAAA AATATGGTAA AATCTTGGTT TTTGTTAATT 900 

30 TATCTCAATA AAAGCCCACT GGRACTCCAA AAAAAAAAAA AAAAAAAAGA NN 952 



35 (2) INFORMATION FOR SEQ ID NO: 118: 

" sequence" C^RACT^ICTICS : ' ' ' 

(A) LENGTH: 1256 base pairs 
<B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

45 GACGTCATAG GTAAACAGGC TCTGTATCCG TGGCAGCGGC CGTGGCAGGC TGGCTGGGTA 60 

CCGGCTGTCG CTGACCCAGG AGAAGCTGCC TGTCTACATC AGCCTGGGCT GCAGCGCGCT 120 

GCCGCCGCGG GGCCGGCAGC TGAACTATGT GCTCTTCAGG GCGGGCACCG TGTTGCATTC 180 

50 

ATCTTTGTAC CCCCAGCATC TAGCAGTGTT GGCATGTAGT AGGCACTCAA GAAATGTGTG 240 

TTGAATGAAC GATGCCTGTG ACAAGCAAGC GGACTTTATT CTTTCCTGAC CCTTGCTCCT 300 

55 ATGACACACC TCCTCCTGAC TGCCACTGTC ACTCCTTCAG AGCAGAACTC CTCTAGGGAA 360 

CCTGGATGGG AAACAGCCAT GGCCAAGGAC ATCCTGGGTG AAGCAGGGCT ACACTTTGAT 420 

GAACTGAACA AGCTGAGGGT GTTGGACCCA GAGGTTACCC AGCAGACCAT AGAGCTGAAG 480 

60 
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GAAGAGTGCA AAGACTTTGT GGACAAAATT GGCCAGTTTC AGAAAATAGT TGGTOGTTTA 540 

ATTGAGCrrG TTGATCAACT TGCAAAAGAA GCAGAAAATG AAAAGATGAA GGCCATCGGT 600 

"5 GCTCGGAACT TGCTCAAATC TATAGCAAAG CAGAGAGAAG CTCAACAGCA GCAACTTCAA 660 

GCCCTAATAG CAGAAAAGAA AATGCAGCTA GAAAGGTATC GGGTTGAATA TGAAGCTTTG 720 

TGTAAAGTAG AT^GCAGAACA AAATGAATTT ATTGACXIAAT TTATTTTTCA GAAATGAACT 780 

10 

GAAAATTTCG CTTTTATAGT AGGAAGGCAA AACAAAAAAA AGCXTTCTCAA AACCAAAAAA 840 

ACCTCTGTAG CATTCCAGCG GCTTGACCAA TGACCTATGT CACAAGAGGT GGCX5TGTAAG 900 

15 GAATGCAGCC CCCTGAAGAC AGCACTACAA GTCTGGGGGA GCGAGTTTTA ACATCAGTGC 960 

ACAGCTGCTG CTGGTGGCCC TGCAGTGTAC GTTCTCACCT CTTATGCTTA GTTGGAACTA 1020 

AQCAGTTTGT AAACTTTCAT CCTTTTTrTT GTAAATTCAC AAAGCTTTGG AAGGAGAAGC 1080 

20 

AATAAATTTT TGTTTTCAAA TGGCTTGATG TACCTTTTTT CCTGTTGCTC TTGAAATATG 1140 

TTTAACTCCT CATGAGAGAA CCCTGGATTC TCTATCCXrCT AGTCCACAAA ACAAACCAGG 1200 

25 CAGTQGTCAG CAGCTACCTT TNATTTGGAT CACACACGTG AGTCAGACAG TACCAC 1256 



30 (2) INFORMATION FOR SEQ ID NO: 119: 

<i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1143 base pairs 
(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



40 GGCCGTAGCA GCCGGGCTGG TCCTGCTGCG AGCCGGCGGC CCGGAGTGGG GCGGCGGCAT 60 

GTACCTTCCA CATTGAGTAT TCAGAAAGAA GTGATCTGAA CTCTGACCAT TCTTTATQGA 120 

TACATTAAGT CAAATATAAG AGTCTGACTA CTTGACACAC TGGCTCGAGC AAACATGAAC 180 

45 

GTTGGAGTTG CCCACAGTGA AGTGAATCCA AATACCCGTG TCATGAACAG CCGGGGTATG 240 

TGGCTGACAT ATGCATTGGG AGTTGGCTTG CTTCATATTG TCTTACTCAG CATTCCdTC 300 

50 TTCAGTGTTC CTGTTGCTTG GACTTTAACA AATATTATAC ATAATCTGGG GATGTACGTA 360 

TTTTTGCATG CAGTGAAAGG AACACCTTTC GAAACTCCTG ACCAGGGTAA AGCAAGGCTC 420 

CTAACTCATT GGGAACAACT GGACTATGGA GTACACTTTA CATCTTCACG GAAGTTTTTC 480 

55 

ACAATTTCTC CAATAATTCT ATATTTTCTG GCAAGTTTCT ATACGAAGTA TGATCCAACT 540 

CACTTCATCC TAAACACAGC TTCTCTCCTG AGTGTACTAA TTCCCAAAAT GCCACAACTA 600 

60 CATGGTGTTC GGATCTTTGG AATTAATAAG TATTGAAATG TTTTGAAACT GAAAAAAAAT 660 
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TTTACAGCTA CTGAATTTCT TATAAGGAAG GAGTGGTTAG TAAACTGCAC TGTTTCTSTG 720 

ATAATGTGAA ATGAGAAGTA TTTACATTGG AGGGCCAATG GCTGGTCCTT CAAGTGCTGT 780 

-5 

TrTGAAGTGC AGATTTCCAT TAAATGATGC CTCTGTTTAA TACACCTGGT ACATTTCTGA 840 

AGAGGGGCTT TATAAGCAGG CTGGGCAGGC CXIAGCTTATA AGTTAAAGGG CATCACAGTG 900 

10 AGGGTGTAGT AGATAAATTC AAGGAAATAA GAGATTTGTA AGAAACTAGG ACCAGCTTAA 960 

CTTATAATGA ATGQGCATTG TGTTAAGAAA AGAACATTTC CAGTCATTCA GCTGTGGTTA 1020 

TTTAAAGCAG ACTTACATGT AAACCGGAAT CCTCTCTATA CAAGTTTATT AAAGATTATT 1080 

15 

TTTATTACCG TAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAANA 1140 

GAN 1143 

20 

(2) INFORMATION FOR SEQ ID NO: 120: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1782 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOIiOGY: linear 

30 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
CAGGCCCCGG CCCCCCACCC ACGTCTGCGT TGCTGCCCCG CCTGGGCCRG GCCCCAAAGG 60 
35 CAAGGACAAA GCAGCTGTCA GGGAACCTCC GCCGGAGTCG AATTTACGTG CAGCTGCCGG 120 
ci^CCACMG OTCCAAGATC GTITGCGGGG' GCTTCGCGTC 'rTCCAAGA;^:' TCCC^ ISO 
CCCTCAACCT GCTTTACACC TTGGTTAGTC TGCTGCTAAT TGGAATTGCT GCGTGGGGCA 240 

40 

TTGGCTTCGG GCTGATTTCC AGTCTCCGAG TGGTCGGCGT GGTCATTGCA GTGGGCATCT 300 
TCTTGTTCCT GATTGCTTTA OTGGGTCTGA OTGGAGCTGT AAAACATCAT CAGGTGTTGC 360 
45 TATTYTTTTA TATGATTATT CTGTTACTTG TATTTATTGT TCAGTTTTCT GTATCTTGCG 420 
CTTGTTTAGC CCTGAACCAG GAGCAACAGG GTCAGCrTCT GGAGGTTGGT TGGAACAATA 480 
CGGCAAGTGC TCGAAATGAC ATCCAGAGAA ATCTAAACTG CTGTGGGTTC CGAAGTGTTA 540 

50 

ACCCAAATGA CACCTGTCTG GCTAGCTGTG TTAAAAGTGA CCACTCGTGC TCGCCATGTG 600 
CTCCAATCAT AGGAGAATAT GCTGGAGAGG TTTTGAGATT TGTTGGTGGC ATTQGCCTGT 660 
55 TCTTCAGTTT TACAGAGATC CTGGGTGTTT GGCTGACCTA CAGATACAGG AACCAGAAAG 720 
ACCCCCGCGC RAATCCTAGT GCATTCCTTT GATGAGAAAA CAAGGAAGAT TTCCTTTCGT 780 
ATTATGATCT TGTTCACTTT CTGTAATTTT CTGTTAAGCT CCATTTGCCA GTTTAAGGAA 840 

60 
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GGAAACACTA TCTQGAAAAG TACCTTATTG ATAGTGGAAT TATATATTTT TACTCTATGT 900 

TTCTCTACAT GTTTTTTTCT TTCCGTTGCT GAAAAATATT TGAAACTTGT GGTCTCTGAA 960 

GCTCGGTGGC ACCTGGGAAT TTACTGTATT CATTGTCGGG CACTGTCCAC TGTGGCCTTT 1020 

CTTAGCATTT TTACCTGCAG AAAAACTTTG TATGGTACCA CTGTGTTGGT TATATGGTGA 1080 

ATCTGAACGT ACATCTCACT GGTATAATTA TATGTAGCAC TGTGCTGTGT AGATAGTTCC 1140 

TACTGGAA?A AGAGTGGRAA TTrATTAAAA TCAGAAAGTA TGAGATCCTG TTATGTTAAG 1200 

GGAAATCCAA ATTCCCAATT TTrTTTGGTC TTTTTAGGAA AGATGTGTTG TGGTAAAAAG 1260 

TGTTAGTATA AAAATGATAA TTWACTKGTA GTCTTTTATG ATWACACCAA TGTATTCTAG 1320 

AAATAGTTAT GYCYTAGGAA ATTC3TGGTTT AATTTTTGAC TTTTACAGGT AAGTGCAAAG 1380 

GAGAAGTGGT TTCATGAAAT GTTCTAATGT ATAATAACAT TTACCTTCAG CCTCCATCAG 1440 

AATGGAACGA GTTTTGAGTA ATCAGGAAGT ATATGTATAT GATCTTGATA TTGOTTTATA 1500 

ATAATTTCAA GTCTA/^AAGA CIX3CATTTTT AAACAAC3TTA GTATTAATGC GTTGGCCCAC 1560 

GTAGCAAAAA GATATTTGAT TATCTTAAAA ATTGTTAAAT ACCX3TTTTCA TGAAAGTTCT 1620 

CAGTATTGTA ACAGCAACTT GTYAAACCTA AGCATATTTG AATATGATCT CCCATAATIT 1680 

GAAATTGAAA TCGTATTGTG TGGCTCTGTA TATTCTGTTA AAAAATTAAA GGACAGAAAC 1740 

CTTTCTTTGT GTATGCATGT TTGAATTAAA AGAAAGTAAT GG 1782 



35 

(2) INFORMATION FOR SEQ ID NO: 121: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 610 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

45 

GTTGGCTGCA GATTTGTGGT GCGTTCTGAG CCGTCTGTCC TGCGCCAAGA TGCTTCAAAG 60 

TATTATTAAA AACATATGGA TCCCCATGAA GCCCTACTAC ACCAAAGTTT ACCAGGAGAT 120 

50 TTGGATAGGA ATGGGGCTGA TGGGCTTCAT CGTTTATAAA ATCCGGGCTG CTGATAAAAG 180 

AAGTAAGGCT TTGAAAGCTT CAGCGCCTGC TCCTGGTCAT CACAACCAGA TTTACTTGGA 240 

GTACATGTGA AAGAAAACGT CAGTCTGCCT GTAAATTTCA GCAAGCCGTG TTAGATGGGG 300 

55 

AGCGTGGAAC GTCACTGTAC ACTTGTATAA GTACCGTTTA CTTCATGGCA TGAATAAATG 360 

GATCTGTGAG ATGCACTGCT ACCTGGTACT GCTTTCAGTG TGTTCCCCCT CAGCCCTCCG 420 

60 . GCGTGTCAGG CATACTCTGA GTAGATAATT TGTCATGCAG CGCATGCAAT CAGAATCTCA 480 
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CTGAGCCACC CATCATTGTG AAATAATTAC CTCAGTTGTA CAGGACTTGG TGATCAGGAT 540 

CCAGGCACTC ACTTGTATTC TACTGCTCAA TAAACGTTTA 1TAAACTTGA AAAAAAAAAA 600 

5 

AAAAAAAAAA 610 



10 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 526 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

20 

GGTACGCCTG CAGGTACCGG TCCGGAATTC CGGGTCGCCC ACGCGTCNGG CCACGCGTCC 60 
ACCCACGCGT CCGSCCACGC GTCGGAGCCG AGCCGGACTG GTCAGGATGA TCACGGACGT 120 
25 GCAGCTCGCC ATCTTCGCCA ACATC3CTC5GG CGTGTCGCTC TTCTTGCTTG TCGTTCTCTA 180 
TCACTACGTG GCCGTCAACA ATCCCAAGAA GCAGGAATGA AAGTGGCGCT TTCTCCGCCC 240 
CAGGGTTCCA GGACATAGTC TGAGGCAAGA TGGAGGGTAT GAGGGGCCTT CACACTTCAC 300 

30 

TTCATCCCTT CTACCCATCA CAACATACAA AGCAACTACA CCTGGATTTT TCCAAACAAC 360 
TTTTATTTCC TCAGAGTCTT CCTTAATCCT ATGGAACAAG AAGCTGCCAC TGAATAGGGC 420 
35 CCAGTATAGG GGCTTGCTTT TCTACTCCCT CCCCCCAATA TAAAAATATA GACTTTTTAA 480 
AAAAAAAAAA AAAAANTTCG NGC3GGGGSCC GGTACCCATC CCCCTA " ' 526 

40 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTO: 2081 3Dase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

TGTACCGGTC CGGAAATTCC CGGGTCGACC CACGTCGTCS GGGGAACATG GCGGCTKCGG 60 

AGCCGGCGGT CCTTGCGCTC CCCAACAGCG GCGCCGGQGG CGCGGGGGCG CCGTCGGGCA 120 

55 

CAGTCCCGGT GCTCTTCTGT TTCTCAGTCT TCGCGCGACC CTCGTCGGTG CCACACGGGG 180 

CGGGCTACGA GCTGCTCATC CAGAAGTTCC TCAGCCTGTA CGGCGACCAG ATCGACATGC 240 

60 ACCGCAAATT OGTGGTGCAG CTGTTCGCCG AGGAGTGGGG CCAGTACGTG GACTTGCCCA 300 
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AGGGCTTCGC GGTRAGCGAG CGCTGCAAGG TGCGCCTCGT GCXTGYTGCAG ATCCAGCTCA 360 

CTACCCTGGG AAATCTTACA CCTTCAAGCA CTGTGTTTTT CTGCTGTGAT ATGCAGGAAA 420 

GGTTCAGACC AGCCATCAAG TATTTTGGGG ATATTATTAG CGTGGGACAG AGATTGTTGC 480 

AAGGQGCCCG GATTTTAGGA ATTCCTGTTA TTGTAACAGA ACAATACCCT AAAGGTCTTG 540 

GGAGCACX3GT TCAAGAAATT GATTTAACAG GTGTAAAACT GGTACTTCCA AAGACCAAGT 600 

TTTCAATGGT ATTACCAGAA GTAGAAGCGG CATTAGCAGA GATTCCCGGA GTCAGGAGTG 660 

TTGTATTATT TGGAGTAGAA ACTCATGTGT GCATCCAACA AACTGCCCTG GAGCTAGTTG 720 

GCCGAGGAGT CGAGGTTCAC ATTGTTGCTG ATGCCACCTC ATCAAGAAGC ATGATGGACA 780 

GGATGTTTGC CCTCGAGCGT CTCGCTCRAR CCGGGATCAT AGTGACCACG AGTGAGQCTG 840 

TTCTGCTTCA GCTGGTAGCT GATAAGGACC ATCCAAAATT CAAGGAAATT CAGAATCTAA 900 

TTAAGGCX5AG TGCTCCAGAG TCGGGTCTGC TTTCCAAAGT ATAGGACATT TGAAGAACTG 960 

GTATGCTACT CACTGGTGAA GGACAGTCAG GTGAAGGACT GTAAGCCTAC ACAAGCTCOT 1020 

CTTATCTCTA CTAGAATTAA AATGTTAAGT CAAAAACGGC TCCTTTTTTG CGCCTCCTAG 1080 

TGAAACTTAA CCAGCTAGAC CATTTGAGTA CCAGCATTTA GTTACAAACG TCAAAGGCTT 1140 

CCGGTGCTGC TTACCTTCCT TTTTTGTTAA TGTGCTTTTA TTTATTAAAA AAAATTACAA 1200 

TGAAGATGCC TGTTTTGTCT CTACTGTGTA CTCTGATCGT ATCTTTCCAA AGTGCAGACT 1260 

CTTGTGAAGT TTTCTTAAAT TGTTCACTTT AAAGAAAATG ACGTACCAAC AATGATTTGG 1320 

CTTTTATATT ACTGTAAGAT GTTATAATGT TAATGTGGAT GTAGTGCTTT TACTTTACAG 1380 

ATTGATTGGA ATAAGATTAT TGCATATGAA TTTACCCACA GGACTCTGAA TCATGTTACC 1440 

CACTCCCCTC ACAATGTTGT CCACTTAGTG AGTTGCATTG ATCTATCCX3T ACCAAATGAT 1500 

GTTGAATAAT TACATATCTT TCTTGACTAT ACTGATTTCT TATTTTQGTC ACTATTACTA 1560 

AATCTCTGTT AATATTCTCT CTTTTAACTG AAAAGGGATG GGATAGAAGG GTTTGCAATG 1620 

CCATATTATT GGTGGAGGGC TGTTTTAACA TCTTTGAAGT ATGGCTTGCT GAATATCTTT 1680 

ACCAACATCT TGAATATATA TTCTAGTGTC CACAAGATTT AGCAAAAAGA TAAAGCTTGG 1740 

GTGGAATATC ATTTTAAAAT GTTCATGTTC TGTTCTATAT TTTCTTCACC TACTCTCCAA 1800 

ATATTGTAAT GCAAAAAGTC TCAGTAATGA TTTGGTAGTA TTAATTTTGT GGTCATTGTT 1860 

TCTCTTCGAT AAATTTATIT TCATTAAATA CTTRTTAGAG GGTTTTGAAA TGTTTTTCAA 1920 

ATATGTGAAA TGTGAAACTG CTGTCTTTTA TATTAAAGTA ATTAAAGAAA ATGTATTGTG 1980 

ATTGAAATTA TTTTGNCCTC CACAAGATGG CTCTATGAGT ATTCTTCCAG GGATTCTAAT 2040 

ATTTATTTAA GGTNATAAAA TCTTGACATT TATAATCTTT C 2081 
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-5 (2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 
<B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : double 

(D) TOPOXjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

15 CCCCGGCGGA GCTGGACCCG CGGTC3GGCTA GGGGCAGGGC CGGAGCCGCG GCGGCGGAGC 60 

TGTGGATCCT TCATGATGAG AGATTTGGGG ACACTTCTCT CTCCTGTGTG TAGTTGATAG 120 

TTTGGTGGTG AAGAGATGGC TGACAGTGTC AAAACCTTTC TCCAGGACCT TGCCAGAGGA 180 

20 

ATCAAAGACT CCATCTGGGG TATTTGTACC ATCTCAAAGC TAGATGCTCG AATCCAGCAA 240 

AAGAGAGAGG AGCAGCGTCG AAGAAGGGCA AGTAGTGTCT TGGCACAGAG AAGAGCCCAG 300 

25 AGTATAGAGC GGAAGCAAGA GAGTGAGCCA CGTATTGTTA GTAGAATTTT CCAGTGTrGT 360 

GCTTGGAATG GTGGAGTGTT CTGGTTCAGT CTCCTCPrGT TTTATCGACT ATTTATTCCT 420 

GTGCTTCAGT CGGTAACAGC CCGAATTATC GGTGACCCAT CACTACATGG AGATGTTTGG 480 

30 

TCGTGGCTGG AATTCTTCCT CACGTCAATT TTCAGTGCTC nTGGGTGCT CCCCTTGTTT 540 

GTGCTTAGCA AAGTGGTGAA TGCCATTTGG TTTCAGGATA TAGCTGACCT GGCATTTGAG 600 

35 GTATCAGGGA GGAAGCCTCA CCCATTCCCT AGTGTCAGCA AAATAATTGC TGACATGCTC 660 

TTCAACCTTT TGCTGCAGGC TCTTTTCCTC ATTCAGGGAA TGTTTGTGAG TCTCTTTCCC 720 

ATCCATCTTG TCGGTCAGCT jGGTTAGTCTC CTGCATATGT CCCTTCTCTA CTCACTGTAC 780 

40 

TGCTTTGAAT ATCGTTGGTT CAATAAAGGA ATTGAAATGC ACCAGCGGTT GTCTAACATA 840 

GAAAGGAATT GGCCTTACTA CTTTGGGTTT GGTTTGCCCT TGGCnTTCT CACAGCAATG 900 

45 CAGTCCTCAT ATATTATCAG TGGCTGCCTT TTCTCTATCC TCTTTCCTIT ATTCATTATC 960 

AGCGCCAATG AAGCAAAGAC CCCTGGCAAA GCRTATCTCT TCCAGTTGCG CCTCTTCTCC 1020 

TTGGTGGTCT TCTTAAGCAA CAGACTCTTC CACAAGACAG TCTACCTGCA GTCGGCCCTG 1080 

50 

AGCAGCTCTA CTTCTGCAGA GAAGTTCCCT TCACCGCATC CGTCGCCTGC CAAACTGAAG 1140 

GCTACTGCAG GTCACTGAGT TGCCTGCCAT CCAAAGGGGA TGGGCGGGAT TGGAAGAAGC 1200 

55 TGTGGCAGCT CTTTTCCCTG TTCACCTCCC GCCTGCCAGG GAAGGCAGGA CCCGCTCTGC 1260 

CAAGGGCCCT CTGCGTATTC CCTTCTCTCT GAGGAATTGA AATTTTTGTC TCTGGTGCAC 1320 

GTAAGGCAGA ATGTTCCCTG ACACCAGTGT GTGGATTTTT AACATCACCG TGAGTCTGAA 1380 

60 
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AGGACCACAG GTTnTCTGC JWSCTATTTTC TAGCATTTGC CAGTCCCTGT GCCTGGACTG 1440 

ATTGGAACAC TTTGTTTTTC TCCCTGTGCC ATTTACCCTT CCACCTTTCC ATCCTGCCTT 1500 

-5 CTACXIACCCr TGGATGAATG GATTTTGTAA TTCTAGCTGT TGTATTTTGT GAATTTGTTA 1560 

ATTTTGTTGT TTTTCTGTGA AACACATACA TTGGATATGG GAGGTAAAGG AGTGTCCCAG 1620 

TTGCTCCTGG TCACTCCCTT TATAGCCATT ACTGTCTTGT TTCTTGTAAC TCAGGTTAGG 1680 

10 

TTTTGGTCTC TCTTGCTCCA CTGCAAAAAA AAAAAAA 1717 



15 



25 



35 



55 



(2) INFORMATION FOR SEQ ID NO: 125: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

CCACGCGTCC GGTCACTATG TAGTGGAGGG GCAGACACCC TCCCGCAAAT TCTGGAAGGT 60 

TCTTAGTCTC GACTAGGGCA GTAGCCCCAG GACTCCTAGT CGCCGGCTTC AGGTCACTGC 120 

30 CGGCTGAACG GAGCTGCCGT CGCCATGTTT GGCTGCTTGG TGGCGGGGAG GCTGGTGCAA 180 

ACAGCTGCAC AGCAAGTGGC AGAGGATAAA TTTGTTTTTG ACTTACCTGA TTATGAAAGT 240 

ATCAACCATG TTGTGGTITT TATGCTGGGA ACAATCCCAT TTCCTGAGGG AATGGGAGGA 300 

TCTGTCTACT TTTCTTATCC TGATTCAAAT GGAATGCCAG TATGGCAACT CCTAGGATTT 360 

GTCACGAATG GGAAGCCAAG TGCCATCTTC AAAATTTCAG GTCTTAAATC TGGAGAAGGA 420 

40 AGCCAACATC CTTTTGGAGC CATGAATATT GTCCGAACTC CATCTGTTGC TCAGATTGGA 480 

ATTTCAGTGG AATTATTAGA CAGTATGGCT CAGCAGACTC CTGTAGGTAA TGCTGCTGTA 540 

TCCTCAGTTG ACTCATTCAC TCAGTTCACA CAAAAGATGT TGGACAATTT CTACAATTTT 600 

45 

GCTTCATCAT TTGCTGTCTC TCAQGCXX:AG ATGACACCAA GCCCATCTGA AATGTTCATT 660 

CCGGCAAATG TGGTTCTGAA ATGGTATGAA AACTTTCAAA GACGACTAGC ACAGAACCCT 720 

50 NTNTTTTGGN AAACATAATT TGAATAAAAT AATTTTTAAT GGATTNTGNA AAAAAAAAAA 780 

AAAAAAAAAA AAAAAAAAAA AAAA 804 



(2) INFORMATION FOR SEQ ID NO: 126: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 431 base pairs 
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(B) TYPE: nucleic acid 
(C> STRANDEENESS : double 
(D) TOPOIjOGY: linear 

-5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

GGCACAGCCC AGGGCCTTGA AGCCAGCTGG CCCTQGAGAG GGGCTCCTCT GCCAGCITGG 60 

GGAGGGTCTG GGATGGGGCT GCCCCTGATG GCCCTGATGT GGAGTACCTT GCCAGCATCT 120 

10 

GCTGGGGTGA ACTTTATTTT AGCCCTTCCC TTGTTGCTCT TATGGAAGAA CAGAGGAGGG 180 

GTGGGCAGGT CAGTGATGTC AGCAGTGGAG TGATTCCCAG CACAGCGGCT TCTQGGAAGA 240 

15 GGGCATGGAG GCATTTCTTT CAGGGAAATG GTCCATNATT TCAGCCAGAA GGCATTGCAT 300 

TAAGTTAAGT CCNGGACTTT TGTGGCCCAG CTCTGTGTTA TTAAGGGCCC TTGGCGAAGA 360 

CTTCAAGGAG GGGGCAAAAN GACCTTTAAG TTTTTAGGTT TAACACAGGG AACCCNCAAA 420 

20 

GGGTTATTTT G 431 



25 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3752 beise pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

35 

NGGCACGAGG AGAGTCACCT GGACTCAGAA CTAGAGATM 60 

AAACTTCAGC TTTCTAAGCA TAAQGAGTTT CAGAAGACTC TTGGTGGCAA GCAGCCTGTG 120 

40 TATGATACCA CAATTAGAAC TQGCAGAGCA CTGAAAGAAA AGACTTTGCT TCCCGAAGAT 180 

ASTCAGAAAC TTGACAATTT CCTAGGAGAA GTCAGAGACA AATGGGATAC TGTTTGTGGC 240 

AAGTCTGTGG AGCGGCAGCA CAAGTTGGAG GAAGCCCTGC TCTTTTCGGG TCAGTTCATG 300 

45 

GATGCTTTGC AGGCATTGGT TGACTGGTTA TACAAGGTGG AGCCACAGCT GGCTGAGGAC 360 

CAGCCCGTGC ACGGGGGACC TTGACCTCGT CATGAACCTC ATGGATGCAC ACAAGGTTTT 420 

50 CCAGAAGGAA CTGGNGAAAG CGAACAGGAA CCGTTCAGGT CCTGAAGCGG TCAGGCCGAG 480 

AGCTGATTGA GAATAGTCGA GATGACACCA CTTGGGTAAA AGGACAGCTC CAGGAACTGA 540 

GCACTCGCTG GGACACTGTC TGTAAACTCT CTGTTTCCAA ACAAAGCCGG CTTGAGCAGG 600 

55 

CCTTAAAACA AGCQGAAGTG TTTCGAGACA CAGTCCACAT GCTGTTGGAG TGGCTTTCTG 660 

AAGCAGAGCA AACGCTTCGC TTTCGGGGAG CACTTCCTGG ATGACACAGA GGCCCTGCAG 720 

60 TCTCTCATTG ACACCCATAA GGAATTCATG AAGAAAGTAG AAGAAAAGCG AGTGGACGTT 780 
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AACTCAGCAG TAGCCATGGG AGAAGTCATC CTGGCTGTCT GCCACCCCGA TTGCATCACA 840 

ACCATCAAAC ACTGGATCAC CATCATCCGA GCTCGCTTCG AGGAGGTCCT GACATGGGCT 900 

AAGCAGCACC AGCAGCGTCT TGAAACGGCC TTGTCAGAAC TGGTGGCTAA TGCTGAGCTC 960 

CTGGAAGAAC TTCTGGCATG GATCCAGTGG GCTGAGACCA CCCTCATTCA GCGGGATCAG 1020 

10 GAGCCAATCC CGCAGAACAT TGACCGAGTT AAAGCCCTTA TCGCTGAGCA TCAGACATTT 1080 

ATGGAGGAGA TGACTCGCAA ACAGCCTGAC GTGGACCGGG TCACCAAGAC ATACAAAAGG 1140 

AAAAACATAG AGCCTACTCA CGCGCCTTTC ATAGAGAAAT CCCGCAGCQG AGGCAGGAAA 1200 

15 

TCCCTAAGTC AGCCAACCCC TCCTCCCATG CCAATCXriTT CT^CAGTCTGA AGCAAAAAAC 1260 

CCACGGATCA ACXAGCTTTC TGCCCGCTGG CAGCAGGTGT QGCTGTTAGC ACTGGAGCGG 1320 

20 CAAAGGAAAC TGAATGATGC CTTGGATCGG CTGGAGGAGT TGAAAGAATT TGCCAACTTT 1380 

GACTTTGATG TCTGGAGGAA AAAGTATATG CGTTGGATGA ATCACAAAAA GTCTCGAGTC 1440 

ATGGATTTCT TCCGGCGCAT TGATAAGGAC CAGGATGGGA AGATAACACG TCAGGAGTTT 1500 

25 

ATCGATGGCA TTTTAGCATC CAAGTTCCCC ACCACCAAGT TAGAGATGAC TGCTGTGGCT 1560 

GACATTTTCG ACCGAGATGG GGATGGTOAC ATTGATTATT ATGAATTTGT GGCTGCTCTT 1620 

30 CATCCCAACA AGGATGCGTA TCGACCAACA ACCGATGCAG ATAAAATCGA AGATGAGGTT 1680 

ACAAGACAAG TGGCTCAGTG CAAATGTGCA AAAAGGTTTC AGGTGGAGCA GATCGGAGAG 1740 

AATAAATACC GGTTCTTCCT CGGCAATCAG TTTGGGGATT CTCAGCAGTT GCX3GCTGGTC 1800 

35 

CGTATTCTGC GCAACCGTGA TGGTTCGCGT TGGTGGAGGA ' TGGATGGCCT TGGATGAATT 1860 

TTTAGTGAAA AATGATCCCT GCCGAGCACG AGGTAGAACT AACATTGAAC TTAGAGAGAA 1920 

40 ATTCATCCTA CCAGAGGGAG CATCCCAGGG AATGACCCCC TTCCGCTCAC GGGGTCGAAG 1980 

GTCCAAACCA TCTTCCCX3GG CAGCTTCCCC TACTCGTTCC AGCTCCAGTG CTAGTCAGAG 2040 

TAACCACAGC TGTACATCCA TGCCATCTTC TCCAGCCACC CX::AGCC:AGTG GAACCAAGGT 2100 

45 

TATCCCATCA TCAGGTAGCA AGTTGAAACG ACCAACACCA ACTTTTCATT CTAGTCGGAC 2160 

ATCCCTTGCT GGTGATACCA GCAATNAGTT CTTCCXXX3GC CTCCACAGGT GCCAAAACTA 2220 

50 ATCGGGCAGA CCCTAAAAAG TCTGCCAGTC GCCXrTGGGAG TCGGGCTGQG AGTCGAGCCG 2280 

GGAGTCGAGC CAGCAGCCGG CGAGGAAGTG ACGCTTCTGA CTTTGACCTC TTAGAGACGC 2340 

ATTGCTTGTT CCGACACTTC AGAAAGCAGC GCTGCAGGGG GCCAAGGCAA CTCCAGGAGA 2400 

55 

GGGCTAAACA AACCTTCCAA AATCCCAACC ATGTCTAAGA AGACCACCAC TGCCTCCCCC 2460 

AGGACTCCAG GTCCCAAGCG ATAACACTGT CTAAGCACCC CCAAGCCACT ATCCACTTTG 2520 

60 AATCCTGCTC CATACATTGG GTGTATATTT ATTCTGAACG GGAGAAGTTA TATTGTTAAA 2580 
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AGTGTAAAAG AATAATTGTG TTATGAAGCT GCCTTATTTT TOTTCTTTTT GTAAGTTACT 2640 

ATTTTCATGT GAATATTTAT GTAGATAAAA TTTQCCTCCT GGTAACCCTG TAATGGATGG 2700 

-5 

GGCCCAGAAA TGAAATATTT GAC3AAAAACA AGTGAAAAGG TCAAGATACA AATGTGTATT 2760 

AAAAAAAAAA AAGCCTATTA ATAGGGTTTC TGCGCGGTGC AGGCSTrGTAA ACCTGCTTTA 2820 

10 TCTTTTAGGA TTATTCCTAA ATGCATCTTC TTTATAAACT TGACTTGCTA TCTCAGCAAG 2880 

ATAAATTATA TTAAAAAAAT AAGAATCCTG CAGTGTTTAA GGAACTCTTT TTTTGTAAAT 2940 

CACGGACACC TCAATTAGCA AGAACTGAGG GGAGGGCTTT TTCCATTGTT TAATGTTTTG 3000 

15 

TGATTTTTAG CTAAAGAGAG GGAACCTCAT CTAAGTAACA TTTGCACATG ATACAGCAAA 3060 

AGGAGTTCAT TGCAATACTG TCTTrGGATA TTGTTTCAGT ACTGGGTGTT TAAAGGACAA 3120 

20 ATAGCTGCTA GAATTCAGGG GTAAATGTAA GTGTTCAGAA AACGTCAGAA CATTTGGGGT 3180 

TTTAAACTGA TTTGTTGCTC CCTATCCAGC CTAGACACCA GTAACTCTTG TGTTCACCAG 3240 

GACCCAGACC CTTGGCAAGG GATAGGCTCG TTGGTGACAT TGTGAATTTC AGATTTGTTT 3300 

25 

TATCCACTTT TTTTGCTATT TATTTAAATG GTCGATCAAC TTCCCACAAA CTGAGGAATG 3360 

AATTCCACGA GCCTGTTCTG AAAATGTGGA CGTAAGACAA ACACX3TGCTC GTCCTTTAAT 3420 

30 GGAGTTCACC AGCACACTTG TTAACCAGTC CTGTTTGCTT TCGTCTTTTT TTGTGCX3TAA 3480 

TAAAGTCAAC TGACCAAGTG ACCATGAAAA GGGGCTGTCT GGGGCTCCTG TTTTTTAGCT 3540 

GCTGTTCTTC AGCTCCGACC ATGTTGCTGT GTGATTATCT CAATTGGTTT TAATTGAGGC 3600 

35 

AGAAACTGAA.GCTCTACCAA. TeAACaX3TTT_AGAAACAAGA. CACACTTTTG „TATTAAAATT . . 3660 . 

GCTTGCAGTA ACAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAACTCGAGG GGGGCCCX3GT 3720 

40 ACCCAATTCG CCGTATATGA TCGTAAACAA TC 3752 



45 (2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1144 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
55 TGACCCTCTG CCTGCCQGGC TCAGTGCTGG ACGCTTTCTG TTTTCTCGCA GTCGGTCCTC 60 
GGTAACACCA GCGGCCTGTG GTCCACCACT CCATTCAGCA GCTCCATTTG GTCCAGCAAC 120 
CTTAGCAGCG CCTTCCCTTC ACCACTCCAG CAAACACGCT GGCAAGCATC GGCCTCATGG 180 

60 
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GCACAGAAAA CTCCCCTGCT CCTCACGCTC CCTCCACCTC CAGTCCAGCT GACGACTTGG 240 

GACAGACCTA CAACCCGTGG CGGATATGGA GCCCCACGAT TGGAAGAAGA AGCTCGGACC 300 

-5 CrrGGTCTAA TTCGCACTTT CCTCACGAGA ATTAAATTAA GCAAAAAACA AACAAACATA 360 

GTGGGCCCTC GTCTAGATCA TGATGTGCCA GTTTCTGAGA CATCTTTTTA AGGCTCTTAC 420 

TGCAGCTCCC CTCCCCACCC TCCTCTTCTT TGCAAAACAG ACCCAAGCAG GGCAGGCTCA 480 

10 

GACCACTCGC TTCTTTCAGA TCTTTCTTGC AATTATGATA ACATGAGATT TGCTGTTGTG 540 

CTTTTAGAGA AAAGTCTGGA CTCAGCCACA AACTCTAATA AGACCTGTAC ATCTGAGAAC 600 

15 CITTCCCGTT ACTGCGTCTT CACCACCTGT CTTCCCCATG CTTTATTTAT CTGTATGAAC 660 

ACAGATTTGA CATTACAGCT AAGGAAATAA TTTGAGTTGA TTCAGAAATC CTGGCATGTG 720 

ACAATTTTGT TAAATTACCA AGTTTGGTTT TTAATAATTT CTCAATATTA TGCGCCAAGA 780 

20 

TCTAAITTTA AAACTGTATG AGGACTTTGT GCTGAAAATA GAGTATTTTT TTAAAGTAAG 840 

GCTGTCTTGG TTTAAAAGCA GATTACAGAA ATGTAAGTCA ACTTAAGAAC RGTGAATGAA 900 

25 TGTAAAAACA TTCAGTYGAG ACCATATGCA TTTTCTGTGC TGTTTGTACT TGAGGTATGT 960 

AACATTTGTA TACCTGAACT TATTTTAAAG ATGAACTGAA ATGCACATAG CCAAGTCTTG 1020 

AGATACAAGA TTGAATGTGT ATTTCTTAAA AATACAACTT TGTGTTGTAC TTTGAAATAA 1080 

30 

ATGATGCTTT TTTCAAAAAA AAAAAAAAAA AAAAAAAAAC TCGAGGGGGG GCCCGGTACC 1140 

CAAT 1144 

35 

(2) INFORMATION FOR SEQ ID NO: 129: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 



GCATGCAGAG GAGCACCCTG AGCGTGTYCC TGGAGCAGGC GGCCATSTTG GCACGGAGCC 60 

50 ACX5GGTTGCT GCCCAAGTGC ATCATGCAGG CCACGGACAT CATGCGGAAC AGGGCCCAAG 120 

GGTGGAGATT CTGGCCAAAA ACCTGCGAGT CAAGGACCAG ATGCCCCAGG GTGCTCCGCG 180 

CCTCTACCGC CTCTGCCAGC CGCCGGTGGA TGGGGACCTC TGAACACCCA AATGCCCCAC 240 

55 

GCTGGGCCGC GGCCTCTGGA GCTGGGATTT GGGAGGACAC AGCAGGCAGC GCTGGCCTTC 300 

TCCAGGGATG GCCCAANGCT TCCGCARCCG CCCGTTCCGG GACCTGCCCA GCGTCCTCCC 360 

60 TGCCTCCTTC CGGGACAAGC CTGGCCACCC TCGCTGTGAT GACGAGCTGG CTGATTGGCC 420 



SDOCID: <WO 983944aA2 IA> 



wo 98/39448 



352 



PCT/US98/04493 



CTGGGCCGGC CCATTCTTCA CACGCCTGCC AGAAGCTGGA GGGGTGCTGG AGACCCATAG 480 

AGCTGATGGG AGCAGCTGGT GCXrTGGCCTT CX3GCTCCTGC GTCCCCAGAA CCCAAGGGAA 540 

- 5 

CGTCATGGAG GCCACATGGG GCCACCCX3GC TCCCTCGGGA TGGCTCCGCT GCACTTTTGA 600 

AACCCCGGTT TCCTITCAACG TCCACATTCC AGGTGACCAC ACGTSTCTCC TCCICCTCAT 660 

10 CTTAGCTTCC AGGTTCACCC TAACXrCTGTA CTAACCTGCT TGGTGGACTT GGAAAAGACT 720 

TGGCTCTGTC GGGAAAC3GAG AGACQGGGCC TCCATCACGC CTGTTACCAG AGGATCCCCG 780 

AGAGCCACAC CAGCTCTGGA CATCACCGCC CCTQGAACTG GGGCCACCAG CXTCTCSGGCAC 840 

15 

GAGATTTGCT CTGACTTTAT TTATATQGCA TGAAATCTCT GGTTTATTTT GGGATTmT 900 

GTTGTTGGTG TTGTCAAAGT TTGTTTTTTC TAAASTTGTG TGATTATATA TTTGACATTT 960 

20 TACATTTCAA AGAAAGGTAT GTTGTCTAAC AGGGGACCAA CAGAAGGTAG TATTGACAAC 1020 

TGTTCCTGCT TCTACTAAAA AAAAAAGAGC ACAAAAGAAA AACTAAATTA TTGAAAAATT 1080 

AAAAAATGTC ATTGTTTCCT GTTTGTTAAT ATTAGGGTTG TAAGGTGTCG TTTTGAGGTA 1140 

25 

TCGACTGTGA TTCCTTCCCC CACCCTCCAT TCTCCAGCGG TTGGCCX3GTG TTAGAACTCG 1200 

CrCTCTTTGA GTGACTGGCT ACAAGGGCCT GAGAGGTGGC CAGCCAGGGT TGGAGCTGGA 1260 

30 GGGGATGGAG CCCCACCTGA GGTGCCGTGT CACACGGGTT AGAGGGTCAC TGGGAAACAC 1320 

CGGGCGGTCG CTTCTGTGAT TTATTTTCTT GATGGTAACT TCTCAGAGCA GGGCRATTGG 1380 

GACATCACCA GCCAGAGCAC AGGAAGCCAC CCTGCCTGCT GGGGAGGAGG GACCCACACA 1440 

35 

ACeCCeCTCG GCAGTTTGTC CCCCCAGCTT CGGTATGCCT TCAGGGAAAG- GTCACAGCTG- 15O0 " - 

GGGAGGAAGC GGGGGGACGC CTGTCACCCC TGGCAGGTGG TGAGTTCAGG TGGGGGCTCC 1560 

40 CTGCTKCCCC CAGGCCTGGG AGCTTGAAGC CCTCCCGGCA TCTGGCATCC GAGCCTCCCG 1620 

CCCTCCAGGG TGCGCTTCCC TCTCTTGCCG CAGCATACAC GAGGGCAGGC AGTGGCCTTG 1680 

TCACTGTATC TTGCATCAGA GACAAAGGAG GACCCGCTTT AGCCCTGCTG CGGGAAATGG 1740 

45 

GGGATGGCCC AGGGCCAGCG CATTGTGCAC TGGTTTACTT TAAAATGTAC AGATTCTTCT 1800 

CGTTAAATTC TTGATAGATT TTTTATTATT 1830 

50 



<2) INFORMATION FOR SEQ ID NO: 130: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



<xi) SEQUEtTCE DESCRIPTION: SEQ ID NO: 130: 

GGCXTGCCCGG ATGGCGACCC CAGCXTTCGGC CCCAGACACA CGGGCTCTGG TGGCAGACTT 60 

TGTAGGTTAT AAGCTGAGGC AGAAGGGTTA TGTCTGTGGA GCTGGCCCCG GGGAGGGCCC 120 

AGCAGCTGAC CCGCTGCACC AAGCCATGCG GGCAGCKGGA GATGAGTTCG AGACCCGCTT 180 

CCGGCX3CACC TTCTCTGATC TGGCGGCTCA GCTGCATGTG ACCCCAGGCT CAGCCCAACA 240 

ACGCTTCACC CAGGTCTCCG ATGAACTTTT TCAAGGGGGC CCCAACTGGG GCCGCCTTGT 300 

AGarrrciTT GTCITTGGGG CTGCACTGTG TGCTGAGAGT GTCAACAAGG AGATGGAACC 360 

ACTGGTGGGA CAAGTGCAGG AGTGGATGGT GGCCTACCTG GAGACGCGGC TGGCTGACTG 420 

GATCCACAGC AGTGGGGGCT GGTTATCCCA GATCACTGAA GCTGAGATGG CTGATGAAGT 480 

AATTTGCAGT GAAATTTTAA GCGACTGTGA CTCTGCTGCA AGTTCCCCAG ATCTTGAGGA 540 

GCTGGAAGCT ATCAAAGCTC GAGTCAGGGA GATGGAGGAA GAAGCTGAGA AGCTAAAGGA 600 

GCTACAGAAC GAGGTAGAGA AGCAGATGAA TATGAGTCCA CCTCCAGGCA. ATGCTGGCCC 660 

GGTGATCATG TCCATTGAGG AGAAGATGGA GGCTGATGCC CGTTCCATCT ATGTTGGCAA 720 

TGTGGACTAT GGTGCAACAG CAGAAGAGCT GGAAGCTCAC TTTCATGGCT GTGGTTCAGT 780 

CAACCGTGTT ACCATACTGT GTGACAAATT TAGTGGCCAT CCCAAAGGGT TTGCX3TATAT 840 

AGAGTTCTCA GACAAAGAGT CAGTGAGGAC TTCCTTGGCC TTAGATGAGT CXTCTATTTAG 900 

AGGAAGGCAA ATCAAGGTGA TCCCAAAACG AACCAACAGA CCAGGCATCA GCACAACAGA 960 

CCGGGGOTTT CCACGAGCCC GCTACCGCGC CCGGACCACC AACTACAACA GCTCCCGCTC 1020 

TCGATTCTAC- AGTGGTTTTAr-AeAGCAGGee-^CSGGGTCGe GTCTAeftGGG-^eGGGCTAG 1080^ " 

AGCGACATCA TGGTATTCCC C^ACTAAAA AAAGTGTGTA TTAGGAGGAG AGAGAGGAAA 1140 

AAAAGAGGAA AGAAGGAAAA AAAAAAGAAT TAAAAAAAAA AAAAAAAAAA ACAGAAGOTG 1200 

MCCTTGATGG AAAAAAAATA TTTTTTAAAA AAAAGATATA CTGTGGAAGG QGGGAGAATC 1260 

CCATAACTAA CTGCTGAGGA GGGACCTGCT TTGGGGAGTA GGGGAAGGCC CAGGGARTGG 1320 

GGCAGGGGGC TGCITATTCA CTCTGGGGAT TCGCCATGGA CACGTCTCAA CTGCGCAACT 1380 

GCTTGCCCAT GITTCCCTGC CCCACCCCAC CCCTCTTCTC CGGCTCCCTG CCCCTCCAGA 1440 

TTGCCTGGTG ATCTATTTTG TTTCCTTTTG TGTTTCTTTT TCTGTTTTGA GTGTCTTTCT 1500 

TTGCAGGTTT CTGTAGCCGG AAGATCTCCX? TTCCX3CTCCC AGCGGCTCCA GTGTAAATTC 1560 

CCCTTCCCCC TGGGGTIlAATG CACTACXnTG TTTTGGGGGG TTTAGGGGTG TTTTTGTTTT 1620 

TCAGTTGTTT TGTTTTTTTG 'i'l'I'lTri' M TT TTTCCTTTGC CTTTTTTCCC TTTTATTTGG 1680 

AGGGAATGGG AGGAAGTGGG AACAGGGAGG TGGGAGGTGG ATTTTGTTTA TTTTTTTAGC 1740 
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TCATTTCCAG GGGTGGGAAT TTTTTTrTAA TATGTGTCAT GAATAAAGTT GTTTTTGAAA 1800 
AKAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1860 
AAAA 1864 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



<2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2041 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
GGCACGAGCG CGCGGCAGGG CCCTGGACCC GCGCGGCTCC CGGGGATGGT GAGCAAGGCG 60 

CTGCTGCGCC TCGTGTCTGC CGTCAACCGC AGGAGGATGA AGCTGCTGCT QGGCATCGCC 120 

TTQCTGGCCT ACGTCGCCTC TGTTTGGGGC AACTTCGTTA ATATGAGGTC TATCCAGGAA 180 

AATGGTGAAC TAAAAATTGA AAGCAAGATT GAAGAGATGG TTGAACCACT AAGAGAGAAA 240 

ATGAGAGATT TAGAAAAAAG CTTTACCCAG AAATACCCAC CAGTAAAGTT TTTATCAGAA 300 

AAGGATCGGA AAAGAATTTT GATAACAGGA GGCGCAGGGT TCGTGGGCTC CCATCTAACT 360 

GACAAACTCA TGATGGACGG CCACGAGGTG ACCGTGGTGG ACAATTTCTT CACGGGCAGG 420 

AAGAGAAACG TGGAGCACTG GATCGGACAT GAGAACTTCG AGTTGATTAA CCACGAOGTG 480 

_TGGAGCCCCT_CTACATCGAG_ jGTTGACCAGAJATACCATCI^ J3GCATCTCCA_GCCTCC^ 54 0 _ , 

CAAACTACAT GTATAATCCT ATCAAGACAT TAAAGACCAA TACGATTGGG ACATTAAACA 600 

TGTTGGGGCT GGCAAAACGA GTCGGTGCCC GTCTGCTCCT GGCCTCCACA TCGGAGGTGT 660 

ATGGAGATCC TGAAGTCCAC CCTCAAAGTG AGGATTACTG GGGCCACGTG AATCCAATAG 720 

GACCTCGGGC CTGCTACGAT GAAGGCAAAC GTGTTGCAGA GACCATGTGC TATGCCTACA 780 

TGAAGCAGGA AGGCGTGGAA GTGCGAGTGG CCAGAATCTT CAACACCTTT GGGCCACGCA 840 

TGCACATGAA CGATGGGCGA GTAGTCAGCA ACTTCATCCT GCAGGCGCTC CAGGGGGAGC 900 

CACTCACGGT ATACGGATCC GQGTCTCAGA CAAGGGCGTT CCAGTACGTC AGCGATCTAG 960 

TGAATGGCCT CGTQGCTCTC ATGAACAGCA ACGTCAGCAG CCCGGTCAAC CTGGGGAACC 1020 

CAGAAGAACA CACAATCCTA GAATTTGCTC AGTTAATTAA AAACCTTGTT GGTAGCGGAA 1080 

GTGAAATTCA GTTTCTCTCC GAAGCCCAGG ATGACCCACA GAAAAGAAAA CCAGACATCA 1140 

AAAAAGCAAA GCTGATGCTG GGGTGGGAGC CCGTGGTCCC GCTGGAGGAA GGTTTAAACA 1200 

AAGCAATTCA CTACTTCCGT AAAGAACTCG AGTACCAGGC AAATAATCAG TACATCCCCA 1260 
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AACCAAAGCC 
AGGACACAAG 

5 

GTTTAAAGAA 
TGCTTTAATG 
10 CACTTTTTGA 
GTTTTTTTAT 
GTTTCTTTGT 

15 

AAAAGTGGGT 
TTGTCAGGTG 
20 GCTTTATGTT 
GACTTTGAAA 
CTGCGTATCT 
ATGGTTAAAT 
A 



25 



TGCCAGAATA AAGAAAGGAC GGACTCGCCA CAGCTGAACT CCTCACTTTT 1320 

ACTACCATTG TACACTTGAT GGGATGTATT TTTGGCTTTT ITTTGTTGTC 1380 

AGACITTAAC AGGTGTCATG AAGAACAAAC TGGAATTTCA TTCTGAAGCT 1440 

AAATGGATGT GCCTAAAAGC TCCCCTCAAA AAACTGCAGA TTTTGCCTTG 1500 

ATCTCTCTTT TTATGTAAAA TAGCGTAGAT GCATCTCTGC GTATTTTCAA 1560 

CTTGCTGTGA GAGCATATGT TGTGACTGTC GTTGACAGTT TTATTTACTG 1620 

GAAGCTGAAA AGGAACATTA AGCGGGACAA AAAATGCXX3A TTTTATTTAT 1680 

ACTTAATAAA TGAGTCGTTA TACTATGCAT AAAGAAAAAT CXTTAGCAGTA 1740 

GTGGTGCGCC GGCATTGATT TTAGGGCAGA TAAAAGAATT CTGTGTGAGA 1800 

TCTCrnTAA TTCAGAGTTT TTCCAAGGTC TACTTTTGAG TTGCAAACTT 1860 

TATTCCTGTT GGTCATGATC AAGGATATTT GAAATCACTA CTGTGTTTTG 1920 

GGGGCGGGGG CAGGTTGGGG GGCACAAAGT TAACATATTC TTGGTTAACC 1980 

ATGCTATTTT AATAAAATAT TGAAACTCAC CAAAAAAT^ AAAAAAAAAA 2040 

2041 
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(2> INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2012 base pairs 

( B) OT^TE : "nuc^ 

(C) STRANDEDNESS : dovible 

(D) TOPOIOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

TACCAAGCTG CAAGAATCTA CTATATCATG GCAGAAQAAG TAGAGTGGGA CTATTGCCCT 60 

GACCGGAGCT GGGAACGGGA ATGGCACAAC CAGTCTGAGA AGGACAGTTA TGGTTACATT 120 

TTCCTGAGCA ACAAGGATGG GCTCCTGGGT TCCAGATACA AGAAAGCTGT ATTCAGGGAA 180 

TACACTGATG GTACATTCAG GOTCCCTCGG CCAAGGACTG GACCAGAAGA ACACTTGGGA 240 

ATCTTGGGTC CACTTATCAA AGGTGAAGTT GGTGATATCC TGACTGTGGT ATTCAAGAAT 300 

AATGCCAGCC GCCCCTACTC TGTGCATGCT CATGGAGTGC TAGAATCTAC TACTGTCTGG 360 

CCACTGGCTG CTGAGCCTGG TGAGGTGGTC ACTTATCAGT GGAACATCCC AGAGAGGTCT 420 

GGCCCTQGGC CAATGACTCT GCTTGTGTTT CCTGGATCTA TTATTCTGCA GTGGATCCCA 480 

TCAAGGACAT GTATAGTGGC CTGGTGGGGC CCTTGGCTAT CTGCCAAAAG GGCATCCTGG 540 
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NAGCCCCATG GAGGACGGAN TGACATGGAT CGGGAATTTG CATTGTTGTT CTTGATTTTT 600 

GATGAAAATA AGTCTTGGTA TTTGGAGGAA AATGTGGCAA CCCATGGGTC CCAGGATCCA 660 

-5 GGCAGTATTA ACCTACAGGA TGAAACTTTC TTGGAGAGCA ATAAAATGCA TGCAATCAAT 720 

GGGAAACTCT ATGCCAACCT TAGGGGTCTT ACCATGTACC AAGGAGAAOG AC3TGGCCTQG 780 

TACATGCTGG CCATGGGCCA AGATGTGGAT CTACACACCA TCCACTTTCA TGCAGAGAGC 840 

10 

TTCCTCTATC GGAATGGCGA GAACTACCGG GCAGATGTGG TGGATCTOTT CCCAGGGACT 900 

TTTGAGGTTG TGGAGATGGT GGCCAGCAAC CCTGGGACAT GGCTGATGCA CTGCCATGTG 960 

15 ACTGACCATG TCCATGCTGG CATQGAGACC CTCTTCACTG OTTTTTCTCG AACAGAACAC 1020 

TTAAGCCCTC TCACCGTCAT CACCAAAGAG ACTGAAAAAG CAGTGCCCCC CAGAGACATT 1080 

GAAGAAGGCA ATGTGAAGAT GCTQGGCATG CAGATCCCCA TAAAGAATGT TGAGATGCTG 1140 

20 

GCCTCTGTTT TGGTTGCCAT TAGTGTCACC CTTCTGCTCG TTGTTCTGGC TCTTGGTGGA 1200 

GTGGTTTGGT ACCAACATCG ACAGAGAAAG CTACGACGCA ATAGGAGGTC CATCCTGGAT 1260 

25 GACAGCTTCA AGCTTCTGTC TTTCAAACAG TAACATCTGG AGCCTGGAGA TATCCTCAGG 1320 

AAGCACATCT GTAGTGCACT CCCAGCAGGC CATGGACTAG TCACTAACCC CACACTCAAA 1380 

GGGGCATGGG TGGTGGAGAA GCAGAAGGAG CAATCAAGCT TATCTGGATA TTTCTTTCTT 1440 

30 

TATTTArrTT ACATGGAAAT AATATGATTT CACTTTTTCT TTAGTTTCTT TGCTCTACGT 1500 

GGGCACCTGG CACTAAGGGA GTACCTTATT ATCCTACATC GCAAATTTCA ACAGCTACAT 1560 

35 TATATTTCCT TCTGACACTT GGAAGGTATT GAAATTTCTA GAAATGTATC CTTCTCACAA 1620 

AGTAGAGACC AAGAGAAAAA CTCATTGATT GGGTTTCTAC TTCTTTCAAG GACTCAGGAA 1680 

ATTTCACTTT GAACTGAGGC CAAGTGAGCT GTTAAGATAA CCCACACTTA AACTAAAGGC 1740 

40 

TAAGAATATA GGCTTGATGG GAAATTGAAG GTAGGCTGAG TATTGGGAAT CCAAATTGAA 1800 

TTTTGATTCT CCTTGGCAGT GAACTACTTT GAAGAAGTGG TCAATGGGTT GTTGCTGCCA 1860 

45 TGAGCATGTA CAACCTCTGG AGCTAGAAGC TCCTCAGGAA AGCCAGTTCT CCAAGTTCTT 1920 

AACCTGTGGC ACTGAAAGGA ATGTTGAGTT ACCTCTTCAT GTTTTAGACA GCAAACCCTA 1980 

TCCATTAAAG TACTTGTTAG AACACTGAAA AA 2012 

50 



(2) INFORMATION FOR SEQ ID NO: 133: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDINESS: doiible 
60 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

GAGCAGTATT TTAACCAACT TGTATTACAG ATGTTACAGT TCATGTTAGG AAGTCAGAAA 60 

-5 

AGACTTTGTT TGTCTTTGTT CTGCTGATGT GAGTCATGTT TTGTGGGGTC TTCCATGGCA 120 

CATTTACCTG TTGCTCCGTC CAGATGTTGA GGGCCAGTCT AGGCTGACAC ATCCTACCCG 180 

10 AGGACAAGCC TGTTCTCCAT TTCTTCACTC TCCCCTCCCC ATATAGCAAC TCTCCCAGGT 240 

TTAGATTACC GTITTCGACG ACAGATTAAC CAAAAATGCC CCACACAGGT TTTATTACTG 300 

TTATATACTA TACTTTTAAC AGTACAGACC CTAAATTTTA TrATTTGTTG CTCCCCCAAT 360 

15 

CTGATACCAA ATGTTTAAAG TTGTTTGAAA TCCAAACATG GTAGTGTTCA TGGGTAAATA 420 

TTTTCTAGGC TATGTAAGAG TTAGCAGCCC ATAGCATAGA AGTAATCAAG TAGCATCTGA 480 

20 GACTGTTGGA GGCACTAGGG CCTCTCTGGG CCTAACAGCC TCACTTCCCC AGCCTCACCT 540 

TGCTGTCCTC TGACACTGCC ATCAGGGCTG TTAGTGGCAC CTGTATGAGG CCAAGTGTGC 600 

GTCCAGGGGA ACAGCACAGG TTAATGCGTC TCCCTAG7UVC TCATGAAGTC AGTTTAATTC 660 

25 

ATGCATGAAC ATGAGTTCAT TTTATGTTTT ATATAGCTTT CTTAGACATA CCAAACCATC 720 

ATTCATAAAT CAGATAAATT ATTCAGTTTT TGTGTTTAGA AAGCTAAGTA TGTGTAGCTG 780 

30 GAAACAAAAA TGAGCGTGTT TTCTCTCCTG TTAATCTAGA GTGTGCAGTT ACACATGTGT 840 

GGATAATTTC ATGTTCCAGG GGCGCTTGGC ATCTCCCATG GACTGATTCC CAGGAAGAAA 900 

AGCCCAAAGG GAAACCCACG ATTCCTTTCG AOTAGATGTG GGAAAGAGCC CATTGGAGGA 960 

35 

TATGAGGTCC TGTGAAATTC AGTTGTGTGT GTGGCTCCTT GTTAGCAGTC ATGTTGACAT 1020 

GGTGTTAGGA GGCTCCCCAT CCACCCTTTA CATGATGTAG GGACCAGTGT CTTGTGAGAT 1080 

40 TAACCTTGGG ACACAGTGGG TTAGCCTGGA GAAAATGAGA GGCCCTGCCT GGACCCAGGG 1140 

AGAGGAGCCA GTGACACAGG CAGAGCGGTG CAGCCCTCCT TCCCTTCCAT TTGGAGGAGG 1200 

TGGTGCCAGG AGCCTGCCCG CTTACCTCTG CTGAAGCATA AGTGGACTTT GCmTGGGG 1260 

45 

CTTATCTCTG ATACATGCTG GAGCCCTGCC TCTCCACTGC TAGATGGAAC CTGGAATCTC 1320 

TCATCTACCT CTTAGTCTGT CAGTTTCTAC GTGTGAGAAG CAAGCTTGTG GGCCAGTGTC 1380 

50 CTTGTACATG CTGTAGCACT TAAAAAATAA TTCCAGGGTT CCCTGGAAAA CCAGTCCCAG 1440 

GGTTCCTATG ATCTGTAGTT TCTACCTGGA TTATAACTGG TTTTGGGTAC CTGAATTTTG 1500 

ATTGGTTAGC CTTAATTATA GTCTGGCGTG ATCATOTAGA ATCTTTTCTG GTGAACAGAT 1560 

55 

CATAAAGTTC TATCAAGGAG TTCTATCAAG GCATCCATGT CAGTGGTGCT ATGCTGGTTA 1620 

CAACTTGAGA TTTTTGAAAT AAAAAATTTG TCATAAAAAA AAAAAAAAA 1669 

60 
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(2) INFORMATION FOR SEQ ID NO: 134: 

-5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : dotible 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

CACTTTTGCT ATATAACCTA AGTGATAACC CTCTTTTAGT TACCTGCCAA ACTCTGGNCT 60 

15 TGGTTTATAT TGCAGTTAAC ACAGTTACAA AGCTGTAATG GTGTCTTTTT TTCCTTTGTA 120 

ACGGAATGTG TAAATCAAAG TATATACATT GTGTGGTGTT CCTGTTTCTG GAGTTTCATG 180 

AGGATTTACA CATGGCATTC AGTGTTCTGT ATAGATCTGC CTACCTTTGT GAATTCATCT 240 

20 

GTTAACCCCT CTTCCTTTGA GAGAGCACCG GCGATGGTGG TTAACTCCTT GTGTTTTCTC 300 

TCTCTCCTAC TGGTTATTCT TGAATTAAGC ACAGACTCGT CAGCTCGGTT GCTTTATCAT 360 

25 GAATAATGTG TGTGACCTTG CAGTTCTTCC ACAGTTCAGC AAACAAGTGC TAGCTTCACT 420 

GACCAAAAAT TAAGGAAGGA AAACACAGTT TTTAAAACGA TCCATCTTTT AACAGCCGAA 480 

ACCGATGTGT CTATGGTGCT GCACCTTGCT GTTGTACTTC TGAAATCAGA CGTGTGTGAA 540 

30 

CGATCATTTC TGACTTAACC GTGAGATGCT CACGAGTACC CTTCCTGTTG TTTTGTTAGC 600 

ATTGAAATCG AGACTATTTA TTTGGAATAT ATACAACAGT GTTTTTCCAC TGTATTTCAT 660 

35 TTGCAAAAGT TGAGAACTGC TTTCTCTACC TTTTGCAAAA TAATTGATAT TCCATATTGG 720 

ATTCTCAAAG ACTTCGATAT GGTGAACCTA TTAAACCTAG AAATTGTATT CATCCTTTCA 780 

TGACTGTGGC CTGAGTTCCC CAGCCCCTCT CCTCCTTTTT 1TTAGATGAG ATTTAGCACA 840 

40 

CTCTCAGTTA TTTAAACATG CAACATTTCT TGAGTATGTA TGTTGAGGCC ATCTGAGCTC 900 

ATAGCTGATT CAGTAACCAG TTTCATGCTG TGTCATTCAC ACTCACTACT TAATACTGCC 960 

45 ATGGTGAAAA TGTQGAGGAA AAATGTATCC ATGTGTGTCT GGGAAGCATA TACACTTGTA 1020 

CATTTTTTAA TACTCTGATT CTGTAACATT TCTGAGTTTT GTTTTGTTTT ACAGNAAAAA 1080 

AAAAAAAAGT GATAAAGCAA TCAGAAGACC AAGAGGTITA CTATTGATGC TTAGGGTCGT 1140 

50 

CTGACCTTGG CTGGCCAATA GACCTACACG GCCAAATTAA TTTACGAGAG TAATAATTTT 1200 

TCAAAAGCCA ATTTTTTTTC TGTATTTTCT GTATGAAACT GCCAATATCA TGAATAGAAA 1260 

55 GGGAGAACCA TAAAGGAGAA AGAACGTGAT GTTCTGTTAT GTTCATGTAA ACCTAAAGAA 1320 

ACAGTGTGGA GGCAGGCGCG ATCAGCCGAA CTCTAGGGAC TTGGTGTTGC TTGGAAGGCA 1380 

TCCATACCTG CATTTTGCAT TCTTCGTATG TAATCATATT GCCAAAGACA AACTATTTCA 1440 

60 
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TCATTTATTG TAAATAACAC TTTTCCCCAG ACCTACCATA AAGTTTCTGT GATGTATTGT 1500 

CTTCCAGTTG CAATAAAAAT TACTGAGTTG CATCAATTGA AGAAAAAAAA AAAAAAAAAA 1560 

-5 CTCGA 1565 



10 (2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENOS CHARACTERISTICS: 

(A) LENGTH: 2007 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRAMDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

20 TCTAAAAGCC CCCTTATACC CCACTTTGTG CAGCAAAGAT CCCCGTGCAG GTCACAGCCT 60 

GATTTGTGGC CAGGCTGGAC AAATTCCTGA GGCACAACTT GGCTTCAGTT CAGATTTCAA 120 

GCTGTGTTGG TGTTGGGACC AGCAGAAGGC AAACGTCCAG CCAACACACA GGACTGTAAG 180 

25 

AGGACTCTGA GCTACGTGCC CTGTGAAGAC CCCCAGGCTT TGTCATAGGA GGTCGTTCAG 240 

CTTCCCCAAA GTCAGAGGTG ATTTGATTTG GGGAAGACTG AATATTCACA CCTAAGTCGT 300 

30 GAGCATATCC TGAGTTTTAC TTCCTTATGG CTTGCCCTCC AAGTTCTCTC TCTCATACAC 360 

ACACACACCC TTGCTCCAGA ATCACCAGAC ACCTCCATGG CTCCAGCTAT GGGAT^CAGCT 420 

GCATTGGGGC TGCCTTTCTG TTTGGCTTAG GAACTTCTGT GCTTCTTGTG GCTCCACTCG 480 

CGAC3GCAGCT CGGAGGTGTG GACTCCGATT GGGCTGCAGG CAGCTCTGQG ACGGCACAQG 540 



35 



GCGGGCGCTC TGATCAGCTC GTGTAAAACA CACCGTCTTC TTGGCCTCCT GGCAGTTCTT 600 

40 TCTGCGAATA GTCCTCTCCC TGGCCAGTTG AATGGGGGAA GCTGCTGGCA CAGGAAGGAG 660 

AGGCGATCCC GGCTGAGGCT TAGGAAATTG CTGGAGCCGG CTCCAAGCAG ATAATTCACT 720 

GGGGAGGTTT TCAGAGTCAA ACATCATTCT GCCTGTKTTG GGGGCCAGGT GTGTCACACA 780 

45 

ACCATCTCAA AGTCAAAAGC CATCTGGGGC TGCTGCTTCT CTTTCTCAGG CTCTGGGGAA 840 

AGGAATCTCC CTCTCCTCTC ACTTGATTCC AAGTGTGGTT GAATTGTCTG GAGCACTGGG 900 

50 ACTTTTTTTC TCTTTTCCTT GATQGACCAA CAGTGCAAAT GCAATCTCGC CATTTAACTT 960 

TCAGGTCGAT TTCCTTTCCT GATCAGACAT CTTTGTGCCC CCTTTAGGAA GGAAAAGAAT 1020 

ACACCTACGA TGTGCCAGGC ACTGTGTTAG GCGCTTTTAT ATAGATCCTC GTTAGGATGA 1080 

55 

GACTAAQGGA TGAGGACATC TCTTTATAAA AGGCCCCTAA GTAATGGATA AACAGAAACA 1140 

CTTAGAGGTG AGAAGGTCTG TCTTCAAGAT CCAAGGTAAG ATTGCCTTCA GTCTGATGTT 1200 

60 TGTTCTCAAG GACTTATCCC CTACAATATT CTCCCACTCC ATACTTCTCC TTCTACCCCA . 1260 
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CCATGTGCTC CCGTGCACTC CTCAGATGGT CAGAGGGGTA ACCCAAGTCC TTAGAGAATT 1320 

TGGGGACCAA TAGAATATGT GATGTGTGAA TTTTCTTTAA AAAACTTAAG GAGTCTTTGC 1380 

_5 

TACCTTCTGC TTGTTGAGTT GTTTTGGCAT TCATATTAAA AGCCAGCATC TCACTATTTA 1440 

TTGACAGGTT GGGCTGTGTG TGTGCGCATG TGTGTATACA TTTCCAGGCG TGCCTGTGTC 1500 

10 CTGTAGCTTT TTAAAAGGAA ACCC7VGTCAT CXTCACTATGA ATCTGGCATC TTCTTATGCT 1560 

TCTAGTGTTT TGGCCATACA TCAACCAAGG GGTTTAATTT ATCCAATGCT TGACGACATG 1620 

TTCAGGAGGG GCTGGATCAA ATTTTGAGAG GGTTATGGGA AAGOGAGGGG GAGAAGAAAT 1680 

15 

TGACATTTAT TTTATTATTT ATTTTAAATG TTTACATCTT CTTTATGTTG TATCAAGCCT 1740 

GAATAGAAAC TGATAGCATT AAAATACTCC GTTCCTCTCT CTCTTCTCGC TTCCTTTTTT 1800 

20 TTTTTTTTTA AATTTAGGAT AACACATTTT TGTTTCTAAA GTGATTTGTG ATTTGTGCTG 1860 

TATAAACTGT ATAAAAGGTT CTGTTTTTAA AGGTGGATTT TCATTCCTCT GGGGACAGTG 1920 

GTCGCCAAGA CATCTACATT GTAAGAGAAC ACAGTGGAAG ATCCTGTCCT GATTCTCAAA 1980 

25 

AATTATTTTC TCTGTATGAT TAAAAGT 2007 



30 

<2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
- -(Dh-TOPObOGYi^^ineair ~ - - _ . ^ , _ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

40 

CTTTTAACCC TCCCCCTTCA CACACATACA TATCAGGTTG TTTTCTAGTT AAAAACCCAA 60 

GTAGCTCAGA TTCTACTTTA ATGTCAGTGC AGATTTGCAT TGAATCATGC CATTATGTTT 120 

45 TTTCTCATTT TTATGCTGTT GGGTCTTAGT TTTTAAATTG ATATAAAGAA CTCAGCAATG 180 

GTITTATTTT CTACTCATAC TTAGGGTTTA GGAAACACTA CCACTAGTTA TCATTTAATC 240 

AACTTCAATG GTCTACTGAA ACAAAAATGG TAACTTTTCA TTAGTGGATT ATTTAGAGTT 300 

50 

ATAGTAGTTG TTTCCAGAAA ACACTTCCTC ACAATTGTAC TTCCCAATCA AATCATGTGA 360 

TCATACAGTT ATTCCCATGA AAGGCAGAAT GTTTGTTTCA AAATTAATCT AGTTTTCTGT 420 

55 ACATTTAAAT TTGAGAAGGT GACAACTQGC TCmTCCAG TCTTCCTTCA TGTCAGTTTT 480 

CTGATAGACC ACTATTGGCA AACAGTATCT GTCAACTACC AAATGTGTAA AATTTTCTGT 540 

ATTTCACTTT GTCTTATTTG TAAATAGTGA ACTAAAACTT TTGGCAGATC AGCAACATTT 600 

60 
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GCTGAGCCTG 
TGTAATATAT 
TAATAGAGAA 
TATTGAAATA 
GGACTGGGGG 
CCCCACTTGA 
GCCGATAAAC 
CAAAATGTGT 
TACTTTTGTA 
TATTTTTCAG 
TTAAGCAGTT 
AATATTTTTT 
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TTTTTTAAGC 
GCTGTCTATT 
CAGACTTCTC 
AACCCGGTGG 
CTTAAAGGAC 
ATGGTTACTG 
CAACTTGGCT 
GTTCTACACT 
TGGACACATT 
AAGTCAGCAG 
CAGAGAGTAG 
AATGCATATT 



TAATGTGTAT 
TCTAATGTTC 
AAAAAATCTT 
GTTAGATTAC 
TTAGAAGAGA 
GAGTAAACCC 
CTGGTTCATT 
GTTACAGGCT 
AGAATATTCA 
ATTTGTGGCA 
ACTACTCAGA 
GAATCAAATA 



361 

TCTTACTAAT 
ACATTCATAT 
CAGAAGCAGC 
TCATCTGTCC 
CCTGTAAGTA 
ACCTTTACCA 
TTTCTTTTCT 
TCTCTTTTGT 
GAGACCAAAA 
AATCATTTAT 
AAATTATTTC 
A 



GTTCCTATCA AGAATGGA1T 
TTTGAGGTTC TATCTTATTT 
TTATTATTGA AATATCGAAA 
ACCAAGTGGG ACATTTGCAT 
AATCCTGAAA ATGAGCCAAT 
CCCCAATTAC AGCACCCGAG 
TCATTTCTGA TGCTCAGATT 
TTGATTAAAG ATTTTAGTCC 
TAGAAGAATT TGCTGTTAGA 
TTGCCTTTTT AAAAATTCAT 
ACGTAATTGT CTAAGAGGTC 
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660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1291 



(2) INFORMATION FOR SEQ ID NO: 137: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

GGCACGAGGA CCTACTrTTG TAACAGACCA TGGTTGTGTC CAAGGTAAAA CCACAGTGAT 60 

ATrTTTGGAT GCTITGTCTG CAATCTTGAC TTGTTTTTGC AGTATCATTA TTCAGACTTC 120 

AAATTGTGAA TCTTTTAAAC ATCTTGATAA TTTGTTGTTG AGAGCTGTTC ATTCTAAAAT 180 

GTAATGAAAT TCAGTCTAGT TCTGCTGATA AAGATCATCA GTTTTGAAAG GTTACTGATT 240 

TTCCTCTTCC CTCTTAGTTT TTTACCCAAT ATATGGAGAA GAGTAATGGT CAATCTTAAC 300 

ATTTTGTTTT AATTGTTTAA TAAAGCTGCT GGGCAGTGGT GCAGCATTCC TACCTAGTGT 360 

CATAAAAGCA AAATACTTAC ATAGCTTTCT TAAAATATAG GAATGACATT ACATTTTTAG 420 

GAGAAAGTAA GTTQCTTTGC ACCGCCTACT TAATTCCTTT CCATATATTG TGATACAAAC 480 

TTTTGAATAT GGAATCTTAC TATTTGAATA GAAATGTGTA TGTATAATAT ACATACATAC 540 

ATAAGCATAT ATGTGTGTGT GTGTGTGTAT ATATATATAT ATGCATGCTG TGAAACTTGA 600 

CTACACAACA TAAATCACTT TTTAAATTCC AGGAACGGGT AGTCTGACAC GGTGATTATC 660 

CTTTTGAGGC TGAATCCGTT ATTAACTTGT TATTTAGGTT rPACTCCCAG TAGCAAGGGA 720 
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TTCTAAGTTA GTTGCACTTA CATGATTATT GTGATTTAAA ACTAAGAATA AAGGCTGCAT 780 

TTTCAAAGAT AAATTGGAAT TGCTGTTGGT GAAATAACAA CCAAAATACT GAATCTGATG 840 

TACATACAGG TTTCTACAGG AAGAGATGGT ATAATTTACA ATTTGGAGAT TTAATAACCA 900 

GGGCTACCCA GAAAAAGTGA CTTGATAACA TGGTACCAAT AAGTAAGGGA TGCTCTCTCG 960 

GTTTGCTTTT GCCACTTTCA AGATTTTAAC TTCTCAGGTT ATTAATCAAA ATTATTGTAT 1020 

AAGTTAGCCA ATAGAATTTT TAGGTTAAAA CAACAGATGG GGGGTTTGTG GAGTGTTTAA 1080 

TGTCATGGGC ATTTTTAGTA GCATAGACCC TTTGTTCTGC ATTTGAATGT TTCGTATATT 1140 

TTTGTTTCAC AGTTAATCTT CCCTCCCCAA GTTTGCTATT CAAATCAACT GCCTGAATGA 1200 

CATTTCTAGT AGTCTGATGT ATTTTTCTGA GGAATAGTTT GTGATTCCAA TGCAGGTGTC 1260 

TTCATTACCA TTACCTCTAC ACTGCAGAAG AAGCAAAACT CCTTTATTAG AATTACTGCA 1320 

CATGTGTATG GGGAAAATAG TTCTGAAAGG CTAGAATGAT ACAAGTGAGC AAAAGTTGGT 1380 

CAGCTTQGCT ATGGAGTGGT GGCAATAATC TCTAAACATT CCAAAAGACC ATGAGCTGAA 1440 

CCTAAACTCC CTTGGGAATC TGGAACAAAG GAATATGAAA ATTGCCATTT GAAAACTQAC 1500 

CAGCTAATCT GGACCTCAGA GATAGATCAG CCAGTGGCCC AAAGCCATTT CAAGTACAGA 1560 

AATTATAGAG ACTACAGCTA AATAAATTTG AACATTAAAT ATAATTTTAC CACTTTTTGT 1620 

CTTTATAAGC ATATTTGTAA ACTCAGAACT GAGCAGAAGT GACTTTACTT TCTCAAGTTr 1680 

GATACTGAGT TGACTGTTCC CTTATCCCTC ACCCTTCCCC TTCCCTTTCC TAAGGCAATA 1740 

GTGCACAACT TAGGTTATTT TTGCTTCCGA ATTTGAATGA AAAACTTAAT GCCATGGATT 1800 

ri T l ^lXJ^i'lT i * GCAAGACACC TGTTTATCAT CTTGTTTAAA TGTAAATGTC CCCTTATGCT 1860 

TTTGAAATAA ATTTCCTTTT GTAAAAAAAA AAAAAAAAAA AAAAAA 1906 



45 (2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1935 base pairs 
(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOUXSY; linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 138: 

55 TCTGAACTAA TGCTAACAGA TCCCCCTGAG GGATTCTTGA TGGGCTGAGC AGCTGGCTGG 60 

AGCTAGTACT GACTGACATT CATTGTGATG AGGGCAGCTT TCTGGTACAG GATTCTAAGC 120 

TCTATGTTTT ATATACATTT TCATCTGTAC TTGCACCTCA CTTTACACAA GAGGAAACTA 180 

60 



SDOaO: <WO 983d448A2 IA> 
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TGCAAAGTTA GCTGGATCGC TCAAGGTCAC TTAGGTAAGT TGGCAAGTCC ATGCTTCCCA 240 

CTCAGCTCCT CAGGTCAGCA AGTCTACTTC TCTGCCTATT TTGTATACTC TCTTTAATAT 300 

GTGCCTAGCT TTGGAAAGTC TAGAATGGGT CCCTGGTGCY TTTTTACTTT GAAGAAATCA 360 

GTTTCTGCCT CTTTTTGGAA AAGAAAACAA AGTGCAATTG T X T ITX ACTG GAAAGTTACC 420 

CAATAGCATG AGGTGAACAG GACGTAGTTN AGGCCTTCCT GTAAACAGAA AATCATATCA 480 

AAACACTATC TTCCCATCTG TTTCTCAATG CCTGCTACTT CTTGTAGATA TTTCATTTCA 540 

GGAGAGCAGC AGTTAAACCC GTGGATTTTG TAGTTAGGAA CCTGGGKTCA AACCCTCTTC 600 

CACTAATTGG CTATGTCTCT GGACAAGTTT ■ ri ' l " l ' X '' i " i ' l ' i ' r TTTTTTTTAA ACCCTTTCTC 660 

AACnrCACT TTCTATGTCT ACCTCAAAGA ATTGTTGTGA GGCTTGAGAT AATGCATTTC 720 

TAAAGGGTCT GCCAGATAGG AAGATGCTAG TTATGGATTT ACAAGGTTGT TAAGGCTCTA 780 

AGAGTCTAAA ACCTACAGTG AATCACAATG CATTTACCCC CACTGACTTG GACATAAGTC 840 

AAAACTAGCC AGAAGTCTCT TTTTCAAATT ACTTACAGGT TATTCAATAT AAAATTTTTG 900 

TAATGGATAA TCTTATTTAT CTAAACTAAA GCTTCCTGTT TATACACACT CCTGTTATTC 960 

TGGGATAAGA TAAATGACCA CAGTACCTTA ATTTCTAGGT GGGTGCCTGT GATGGTTCAT 1020 

TGTAGGTAAG GACATTTTCT YTTTTTCAGC AGCTGTGTAG GTCCAGAGCC TCTGGGAGAG 1080 

GAGGGGGGTA GCATGCACCC AGCAGGGGAC TGAACTGGGA AACTCAAGGT TCTTTTTACT 1140 

GTGGGGTAGT GAGCTGCCTT TCTGTGATCG GTrTCCXTTAG GGATGTTGCT GTTCCCCTCC 1200 

TTGCTATTCG CAGCTACATA CAACGTGGCC AACCCX:AGTA GGCTGATCCT ATATATCATC 1260 
AGTGCTGGTG- GTCAGTGTGA- ATAGeeceAC -CCAAGCIXSGC -TATAGGTTTA XAGATACATT " ' 1320 ' 

M.TT?sGGChA CCTAAAATAT TGATGCTGGT GTTGGTGTGA CATAATGCTA TGGCCAGAAC 1380 

TGAAACTTAG AGTTATAATT CATGTATTAG GGTTCTCCAG AGQGACAGAA TTAGTAGGAT 1440 

ATATGTATAT ATGAAAGGGA GGTTATTAGG GAGAACTGGC TCCCACAGTT AGAAGGCGAA 1500 

GTCGCACAAT AGGCCGTCTG CAAGCTGGGT TAGAGAGAAG CX:AGTAGTCG CTCAGCCTGA 1560 

GTTCAAAAAC CTCAAAACTG GGGAAGCTGA CAGTGCAGCC AGCCTTCAGT CTGTGGCCAA 1620 

AGGCCAAGAG CCXrCTGGCAA CCAACCCACT GGTGCAAGTC CTAGATTCCA AAGGCTGAAG 1680 

AACCTGGAGT CTGATGTCCA AGAGCAGGAA GAGTGGAAGA AAGCCAGAAG ACTCAGCAAA 1740 

CAAGGTAGAC AGTGTCTACC ACCAYAGTGG CCATACCAAA GAGGCTACCG ATTCCTTCCT 1800 

GCTACCTGGA TCCCTGAAGT TGCCCTGGTC TCTGCACCTT CTA7ACCTAG TTCTTAAGAG 1860 

CTTTCCATTA CATGAGCTGT CTCAAAGCCC TCCAAOVIAAT TCTCAGTGTA AGYTTCAAAA 1920 

AAAAAAAAAA AAAAA 1935 
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(2) INFORMATION FOR SEQ ID NO: 139: 

.5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

NGCCCCCTTG GCACAAGTCA GATGAAGCAC GTTCTGCCGG GGAGGCCCTC AMCTTCCAGA 60 

15 

GAGGACAGAC ACAGATTTCC TGCTGGGGGA GGGAGGAGTC CACGCATCCT GATGCTGCCT 120 

GGAAGCTTAT TTTCCCGTGG CCAGGATGCA TTTCTCTGAG TGGAAACAGG TTCTTGCATG 180 

20 TGGATGTGTG TTTCCCCAGG CAGACGGCCC CTCTYTTCCC AGCACTTCCC TGCCTCCCCC 240 

AGGCCTCAGG CCAGCACCCA GTTCCTCCTC ACATGGCAGG TGAGCACAGA CTTCTAGTTG 300 

GCAGGAGCTG AGGAGGGTGA ACAAACCCCG AGGGAGGCCC GGCCCTTGCT CCCGAGTTGG 360 

25 

GGGGAGGGGG TGTGGCAACG TGCCCCCCGC AGAGGCCACG CATGTTTGAC CAAAGCCCTC 420 

ATTGTGGTCC GAGGACAGCC TTTTCCCCAG GCCTCARAGC ATTGCTCATC CGTGCCAAAC 480 

30 TGGGTAGGTG GATTTGAGCG GAAAGACTCC CAAAATGTGC CAAGAATTTC CCRGTCCCAG 540 

GCAGGGCAGG GGAAACTAAG GGCAAGCAGG ATACAGGGCG AGGGATGTQG CAGGTGAGGG 600 

GGCTCCCGCC TGTGCCCCTT CTCCTCACCA TGTCTCCCCC ACCCTGCCTC AGTTCTCCGT 660 

TCCCCTTCAT CTCCGTCCCC CTCTTTGAAG CTGTCCCCAT CTCAGTGTCA GACCAGCCTT 720 



35 



CTCCTCAKCT GACCACCCTC CTCTGACCSA CGCCCCCTCC TTGTCTGAAA AAAGGAGCCT 780 

40 TGAATGGTGG AGGGAGGCAG TGGGGAGAAA GGTCTCACCG GACAGGTTGG GAGAATGAGG 840 

TCAGCGGTGC TGGGGAACAG ATGGAGGGGG CAGTGGGGAC AGGGCTTGGG CAGACACCAG 900 

CAGGAATAAT TTGAAATGTG TGAGGTGACT CCCCGGAGGC CTTGGGCTTG GGCATTTGGG 960 

45 

AAAAGAATGA TGTCTGGAAG GGCTTAAGGG ACACAGTGGA CGAGGGGAGA GTCCTCATCT 1020 

GCTGGCATTT TGTGGGGTGT TAGTGCCAAA CTTGAATAGG GGCTGGGGTG CTGTCTTCCA 1080 

50 CTGACACCCA AATCCAGAAT CCCTGGTCTT GAGTCCCCAG AACTTTGCCT CTTGACTGTC 1140 

CCTTCTCTTC CTACCTCCAT CCATGGAAAA 1TAGTTATTT TCTGATCCTT TCCCCTGCCT 1200 

GGTCTAGCTC CTCTCCAAAC AGCCATGCCC TCCAAATGCT AGAGACCTGG GCCCTGAAOC 1260 

55 

CTGTAGACAG ATGCCCTCAG AATTGGGGCA TGGGAGGGGG GSTGGGGGAC CCCATGATTC 1320 

AGCCACGGAC TCCAATGCCC AGCTCCTCTC CCCAAAACAA TCCCGACAAT CCCTTATCCC 1380 

60 TACCCCAACC CT TT GCGGCT CTGTACACAT TTTTAAACCT GGCAAAAGAT GAAGAGAATA 1440 
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(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

TTTTTTTTTT TTTGATATGA AATTGTdTT CTCCATTGCA GAAATAAGCT AGGGAAACAC 60 

TAACCCAAAA ACTTTCTGTA GAGCTGTTCC TTTGGAGGCA GCATCACTTA TTGGCAGTAA 120 

20 

AGACTCAGTA TAAAAGCACC AGCATCCCTA CTTGGGTGAT GGGGATTAAT TTTATAGCAT 180 

TCCATTTTCC TAGTGCCACA TGTGAAATTG GATTTTGATG ATCTTAATCT ATATTCTACC 240 

25 dTATAATAA AAGATCAAAA GATATATCTC CTATGAACAG ATTGGAGATA GGAGATGAAA 300 

AGTTGGGAGG ATGTCTTTAT TCTAATGTGA GGGTAGGGAA AATGTGGATA ACATTACTGG 360 

GGTGARQGAG GCATTGTTCT TTAGTTGGAG TTCTCATTTT TATTCTCCAG TACTGACTTG 420 

30 

TGGGGAAAGC ATACTTTTTC ACTGCCAGGT ACTGAATGCA GAGGCTCAGT GAAGTATATA 480 

TGTGGGAAGT GCATGCATTT CGTTTATTAG CAAACATAGC TGGATTAAGA CAAAGTTGTT 540 

35 GGTTTGGAAA GGGGTTAAAG CCTTAAGTGA ACAAATCTAG CTAACAGTGA ATGAACTAGG 600 

- - TAATATAACT TGCATATTTT TAATTTCCTT -TGGTTAAAX3G "TCCCCCATAC-TTCTCTGTTC ^60 

GGAGACATGA GAAGTATGAT TACTTCAGTG TTAGTTTTCT TAATTmTT TTTCCCCTAT 720 

40 

TTGTCCCTTG TCACTTTGTT GCAAGCTAGA AATCTGTGGG TTATACATAG GGCAGCTCTT 780 

TGTGAAAGTG GTTTATTCCA CTGGAGAAAG GGGATTGAAA ATCAGTTAGA ACCAATGTAT 840 

45 . TTCTTGCCCC ACGGAACACT ATTCCTATAA GATAGCTGAA AGAAGCTGCT GTGAGGAGCT 900 

CAGCTCCAAA CACAGGATCA GCACCTTGTA TAGGAATTCC CATGAATTAT GACTTCTCAT 960 

TCTGTTTTAT CAGAGTGCAT ATATGTCCTA CTTCAGGAAA AGTAAAACAG TCATTTACGA 1020 

AAGAAAGTCA ATCTGTATCC TAAGCATTTT AATAAAAAGT TAAAACAAAA AATTAAAAGG 1080 

GACACTCGAG GGGGGGCCCG AAACCCAAT 1109 



50 



55 



<2) INFORMATION FOR SEQ ID NO: 141: 
60 (i> SEQUENCE CHARACTERISTICS: 



SDOaO: <WO_9839448A?_IA> 
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(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

TAGGACTAAC TTAAATTCTT TTATTCATCT TTTATTTATT AAAAAATTTT ATTTCTTTGA 60 

10 ATTTTCCTGT AATTTCCTTA RGCTCTTCTA TAAAATGTTA TATTCATGTG AACCATAOCT 120 

CATTATCCTT AACATTTACT CTCAAAAAGC TTTTTATTTT TATTTTTTTG AAGGTAGTTT 180 

TTCTGTGTGT ACTCTGTAAC ATGATTTTGC TTTCAAATCA TTGTTGTGCC CCCAT/^CAAA 240 

15 

ATGCCTTTTA TTTTTGAGGA TCGTGGACTT TTTAGTATGG CATGAGTGTG CTAAAAGCCA 300 

GATATCTTTC CACATTCACT GGTGGCTTTG ACACCTAGTT TTTAATCTCC CATCCTTACT 360 

20 TTAAACCCTG ACAGTGCAGT CCTCAGTCAG GGCCAGGACC GGGCTGAGGC CCTTTGTGGA 420 

GATGCTGCAC CACCAGCAGA AGGCTGAGAC CTGGTTACCT GTACCTGTTC ACTTGTAATA 480 

AAAAGAATTA TCTAAAA 497 

25 



30 
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(2) INFORMATION FOR SEQ ID NO: 142: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 
35 (D) TOPOLOGY: linear 



( xi) SEQUENeE-DESeRTPTIGN^ - SEQ- ID "142 r 

ATGAGGCAGA GGCAAGCTGC CTGCCAACCC CCTCCCTCAA GGAATGGCCT TGCCCAGGAA 60 
TGCCCACCAC ACATACCCTC TTCTITnTT CTAGTCAAAC TCTTGTTTAT TCCTTGGCTT 120 
GCCTCCCTCC TTTCCTCCCC TCTCAACCTT TTACTTCTGG TTTCTATTTC ATGGGATTTG 180 
45 GGGTTGAAGT TAAACTTACA ACAGTGCCGC CAACACCAAG TCTTGCAGGA AAAAAATACA 240 
AAGAAATTTA ACAAAAAAAA AAAAAAAAA 269 



(2) INFORMATION FOR SEQ ID NO: 143: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 



;CXX:iE>: <WO_983944aA?_tA> 
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TTGATTGACT ATGGTCTCTC CGGCTACCAG GAAGAGTCTG CCGAAGTGAA GGCCATGGAC 60 

TTCATCACCT CCACAGCCAT CCTGCCCCTG CTGTTCGGCT GCCTGGGCGT CTTCGGCCTC 120 

3 

TTCCGGCTGC TGCAGTGGGT GCGCGGGAAG GCCTACXTTGC GGAATGCTGT GGTGGTGATC 180 

ACAGGCGCCA CCTCAGGGCT GGGCAAAGAA TGTGCAAAAG TCTTCTATGC TGCGGGTGCT 240 

10 AAACTGGTGC TCTGTGGCCG GAATGGTGGG GCCCTAGAAG AGCTCATCAG AGAACTCACC 300 

GCTTCTCATG CCACCAAGGT GCAGACACAC AAGCCTTACT TGGTGACCTT CX3ACCTCACA 360 

GACTCTGGGG CCATAGTTGC AGCAGCAGCT GAGATCCTGC AGTGCTTTGG CTATGTCGAC 420 

15 

ATACTTGTCA ACAATGCTGG GATCAGCTAC CX3TGGTACCA TCATGGACAC CACAGTQGAT 480 

GTGGACAAGA GGGTCATGGA GACAAACTAC TTTGGCCCAG TTGCTCTAAC GAAAGCACTC 540 

20 CTGCCXrrCCA tgatcaagag GAGGCAAGGC CACATTGTCG CCATCAGCAG CATCCAGGGC 600 

AAGATGAGCA TTCCTTTTCG ATCAGCATAT GCAGCCTCCA AGCACGCAAC CCAGGCTTTC 660 

TTTGACTGTC TGCGTGCCGA GATGGAACAG TATGAAATTG AGGTGACCGT CATCAGCCCC 720 

25 

GGCTACATCC ACACCAACCT CTCTGTAAAT GCCATCACCX3 CGGATGGATC TAGGTATGGA 780 

GTTATGGACA CCACCACAGC CCAGGGCCGA AGCCCTGTGG AGGTGGCCCA GGATGTTCTT 840 

30 GCTGCTGTGG GGAAGAAGAA GAAAGATGTG ATCCTGGCTG ACTTACTGCC TTCCTTGGCT 900. 

GTTTATCTTC GAACTCTGGC TCCTGGGCTC TTCTTCAGCC TCATGCCTCC AGGGCCAGAA 960 

AAGAGCGGAA ATCCAAGAAC TCCTAGTACT CTGACCAGCC AGGGCCAGGG CAGAGAAGCA 1020 

35 

GCACTCTTAG GCTTGCTTAC TCTACAAGGG ACAGTTGCAT TTGTTGAGAC TTTAATGGAG 1080 

ATTTGTCTCA CAAGTGGGAA AGACTGAAGA AACACATCTC GTGCAGATCT GCTGGCAGAG 1140 

40 GACAATCAAA AACX3ACAACA AGCTTCTTCC CAGGGTGAGG GGAAACACTT AAGGAATAAA 1200 

TATGGAGCTG GGGTTTAACA CTAAAAACTA GAAATAAACA TCTCAAACAG TAAAAAAAAA 1260 

AAAAAAAAC 1269 

45 



(2) INFORMATION FOR SEQ ID NO: 144: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1944 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEENESS : double 

55 (D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
AAAAGGCAAA CTATAGGATA ACACAGAGCC dTTTTGAAA ATAAATTGGC ATTGGAGTGT 60 



ISDOCrO: <WO 9839448A2 IA> 
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TTTACCCTCT AGCTGTTTTA dTAGAATGT AACATATGCT GCCTACCCAC CTCAAAATGT 120 

CTGTACTGCA AGAGGGCCCT GGGCCTCTGC TTTCCATATT CACGTTTGGC CAGAGTTGTA 180 

GTCCCAAAGA AGAGCATGGG TGGCAGATGG TAGGGAATTG AACTGGCCTG TGCAATGGGC 240 

ATGGAGCACA AGGGGTCACA GCATGCCTCC TGCCTTACCG TGGCAGTACG GAGACAGTCC 300 

AGAACATGGT CTTCTTGCCA CGGGGTGTTG TTGTCTCTGG TGGTGCTGCA TGTCTGTGGC 360 

TCACCTTTAT TCTTGAAACT GAGGTTTACC TGGATCTGGC TACTGAGGCT AGAGCCCACA 420 

GCAGAATGGG GTTGGGCCTG TGGCCCCCAA ACTAGGGGGT GTGGGTTCAT CACAGTGTTG 480 

CCTTTTGTCT CCTAAAGATA GGGATCTACT TTTGAAGGGA ATTGTTCCTC CCAAATAAAT 540 

TTGCTTTACC TTGGTCCTTT CTTTTGTGCC AGTATTCAAG TGGTATAGCT CTGAGCAGGG 600 

TCACATTTGG CCAAACCTGA CACTGTCTTG CTGCATTCTC CTTTGGCAAA CATCAGGGTC 660 

AGAATTCAGG ATAGCCCTTC CTAGGGCACT GGACTTTCTG GCATGGGGGC TGTGTTTGCA 720 

CAAGTTATTT TCATGTTACC TGGAGAGTGT CCAGAGGCTG CTCTGAGGCT GAGGTGTGTT 780 

CCCCCTTGCC TGGTTCCAGC TGTCAGAGGG ATACCATCCT AGGGTCTGGG AATCCAAGGC 840 

CACGAGACTC CTTGGTTTGT GGTCCGAGAT CCTGTACTAA GGAGGGTCTG GCCAGAGGAA 900 

CAGACCAGCT TTTGCACAAT GAAGCGCAAG GGAACAAGTG GTTTGCCTGG TGTCCTACCT 960 

GTCCTGAACC TGGTCCTGTG GGCCATTGAA AAGTTAGATC TGTGATCTCT GGGGTTTTTG 1020 

TGGCTITGTT CAATGCTTCC ACTCTAQGGC AGGCAGAGCA GTCTATACTC TCCCAAGCCT 1080 

GCTTGACCTC CAAGTAGAGC TGATACAGAG ATCTGTGAAT ATTGTGATAG AAATTCTTTG 1140 

GTAOTCATAC: ATTTC^SCIG'cAA^^ AlT:TCCCAOT~TAja^ 1200 ~ 

AGTCATTCTT AAAGACAGAG GATAGCTGTG ACTCATGGGA TCATGAQGTC CATGGCTGGT 1260 

TGCAGGTTCC CTTTTTCCTT CXnCAGGTTT TGTCTCITCC TGTGTTGTCC CCAGCAAGGG 1320 

AGAGACTGTG GGGTGGATTG GGAGAACAGA TTAGGAGTAT AGCAAATGAA CCCAGAATGG 1380 

AACAGTGGGG AGCTAACTGT GAATGAGGAG AGTACCTGCT GCAGGACCTG QAGGTCAGGT 1440 

GTGAATGCTG TATTGGCACA GGGAATAAAT ATCCTGGCX3T CTGGAGCCTT CACCTCTCCG 1500 

TCAAGTCCTT CCTGTGATAC TGCCATCGCA CAGGATCTGA GTTGCAGCTC TGCACCCTAA 1560 

ATCACACCCT GGGCATTGTC TGGGCTGCAG GGCTGCCAGG TTCTGTACTT GTGTCCAGCT 1620 

GTGGCCCTGG ATGCTGGAGC TGGAGGGTTT TCTGTGCTCA GACTGTAGCC TGTAGCTCTT 1680 

GGCXrrGTGTA GAGCCCCCTC CTGTGCCCTC AGTGGCTGTC GTTTGTTAAC ATCATCAGGA 1740 

AGATGGGAAA GGTCAGGCAG AATITTTCTG CCCTACAAAG GGTGGAAGAG AAAGGACACA 1800 

GTATTTTCAT GAATTTACCA TATATCTTTG TTTTrCTTCA ACGAAAAAGT TAATTGAGGC 1860 
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AATGTCATCT GCTCAAAGTT GAGTGGTTTA TTCACAATAA ACTGTAAGTT TCTGATTATA 1920 
AAAAAAAAAA AAAAAAAAAA AAAG 1944 
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(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICHSI : SEQ ID NO: 145: 

TCGACCCACG CGTCCGGGGT GCGCAACGGG GAGTTCCGGC TGGAGACCCG TGCTCTGGGC 60 

CGGCGCCTTC ACCATGGCCT CGGCAGAGCT GGACTACACC ATCGAGATCC CGGATCAGCC 120 

CTGCTGQAGC CAGAAGAACA GCCCCAGCCC AGGTGGGAAG GAGGCAGAAA CTCGGCAGCC 180 

TGTGGTGATT CTYTTGGGCT GGGGTGGCTG CAAGGACAAG AACCTTGCCA AGTACAGTGC 240 

CATCTACCAC AAAAGGGGCT GCATCGTAAT CCGATACACA GCCCCGTGGC ACATGGTCTT 300 

CTTCTCCGAG TCACTGGGTA TCCCTTCACT TCGTGTTTTG GCCCAGAAGC TGCTCGAGCT 360 

GCTCTTTGAT TATGAGATTG AGAAGGAGCC CCTGCTCTTC CATCTCTTCA GCAACGGTGG 420 

CGTCATGCTG TACCGCTACG TGCTGGAGCT CCTGCAGACC CGTCGCTTCT GCCGCCTGCG 480 

TGTGGTGGGC ACCATCTTTG ACAGCGCTCC TGGTGACAGC AACCTQGTAG GGGCTCTGCG 540 

GGCCCTGGCA GCCATCCTGG AGCGCCGGGC CGCCATGCTG CGCCTGTTGC TGCTGGTGGC 600 

CTTTGCCCTG GTGGTCGTCC TGTTCCACGT CCTGCTTGCT CCCATCACAG CCNTCTTCCA 660 

CACCCACTTC TATGACAGGC TACAGGACGC GGGCTCTCGC TGGCCCGAGC TCTACCTCTA 720 

CTCGAGGGCT GACGAAGTAG TCCTGGCCAG AGACATAGAA CGCATGGTGG AGGCACGCCT 780 

GGCACGCCGG GTCCTGGCGC GTTCTGTGGA TTTCGTCTCA TCTGCACACG TCAGCCACCT 840 

CCGTGACTAC CCTACTTACT ACACAAGCCT CTGTGTCGAC TTCATGCGCA ACTGCGTCCG 900 

CTGCTGAGGC CATTGCTCCA TCTCACCTCT GCTCCAGAAA TAAATGCCTG ACACCTCCCC 960 

ACAAAAAAAA AAAAAAAAAA ACTCGAGGGG GGGCCCGGTA CCCAATTCGC CCTATAAAGG 1020 

T 1021 



55 
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(2) INFORMATION FOR SEQ ID NO: 146; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEW3TH: 1285 base pairs 



SDOCID: <WO 9S3944aAg IA> 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOUXjY: linear 

-5 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

GGCACGAGGA GGGCCACC5GC AGCCATCGCG CTTTGCAGTT CGGTCTCCTG GTGTACGGCC 60 

AACGCCAAGT AGGQGATTGC GTTCCCTCCA GTCGCAGACC CTATCAGATT TGGATATC5TC 120 

CTTCATATTT GATTGGATTT ACAGTGGTTT CAGCAGTGTG CTACAGTTTT TAGGATTATA 180 

TAAGAAAACT GGTAAACTGG TATTTCTTGG ATTGGATAAT GCAGGAAAAA CAACATTCCT 240 

15 ACACATGCTA AAAGATGACA GACTTGGACA ACATGTCCCA ACATTACATC CCACTTCCGA 300 

AGAACTGACC ATTGCTGGCA TGACGTTTAC AACTTTTCAT CTGGGTCGAC ATGTTCAAGC 360 

TCGAAGAGTG TGGAAAAACT ACCTTCCTGC TATCAATGGC ATTGTATTTC TGGTGGATTG 420 

20 

TGCAGACCAC GAAAGGCTGT TAGAGTCAAA AGAAGAACTT GATTCACTAA TGACAGATGA 480 

AACCATTGCT AATGTGCCTA TACTGATTCT TGGGAATAAG ATCGACAGAC CTGAAGCCAT 540 

25 CAGTGAAGAG AGGTTGCGAG AGATGTTTGG TTTATATGGT CAGACAACAG GAAAGGGGAG 600 

TATATCTCTG AAAGAACTGA ATGCCCGACC CTTAGAAGTT TTCATGTGTA GTGTGCTCAA 660 

AAGACAAGGT TACGGAGAAG GCTTCCGCTG GATGGCACAG TACATTGATT AACACAAACT 720 

CACATTGGTT CCAGGTCTCA ACGTTCAGGC TTACTCAGAG ATTTGATTGC TCAACATGCA 780 

TAACTTGAAT TCAATAGACT TTTGCTGGTT ATAAAACAGA TGTTTTTTAG ATTATTAATA 840 

35 TTAAATCAAC TTAATTTGAA TGAGAATTGA AAACTGATTC AAGTAAGTTT GAGTATCACA 900 

ATCTTAGCTT TCTAATTCCA TAAAAGTACT TGGTTTTTAC AGTTTATAAT CTGACATCAC 960 

CCCAGCGCCA TTTGTAAAGA GCAACTTTCC AGCAGTACAT TTGAAGCACT TTTTAACAAC 1020 

ATGAAACTAT AAACCATATT TAAAAGCTCA TCATGTTAAA TTTTTTATGr ACmTCTGG 1080 

AACTAGTTTT TAAATTTTAG ATTATATGTC CACCTATCKT AAGTGTACAG TTAATAATTA 1140 

45 GCTTATTCAA TGATTGCATG ATGCCTTACA GTTTTCAATA ACTTTTTTTC TTATGCAAAC 1200 

GTCATGCAAT AAAACAAACT CTAATGTTTG GCAAAAAAAA AAAAAAAAAA OTCGAGGGGG 1260 

GGCCCGTACC CAATTCGCCC TAAAG 1285 

50 
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(2) INFORMATION FOR SEQ ID NO: 147: 

55 

(i) SEQUENCE CHARACTERISTICS: 

{A> LENGTH: 1386 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
60 (D) TOPOIiOGY: linear 



3OOCI0: <WO_983944SAa_IA> 



wo 98/39448 



PCT/US98/04493 



371 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

GGCACGAGGT GGCGCAGGGG TCAGTGGTTC TCTCGGGTCT CGGGACAGGT GAGCACCCTG 60 

-5 

ATGAAGGCCA CGGTCCTGAT GCGGCACCTG GGCGGGTGCA GGAGATCCTG GGCGCCCTCC 120 

GCAAGGGCGS CGGAGACCGG TTACAGGTGA TTTCTGATTT TRACATGACC TTGAGCAGGT 180 

10 TTGCATATAA TGGAAAGCGA TGCCCTTCTT CTTACAATAT TCTGGATAAT AGCAAGATCA 240 

TCAGTGAGGA GTGTCGGAAA GAGCTCACAG CGCTCCTTCA CCACTATTAC CCAATTGAGA 300 

TCGACCCACA CCGGACCGTC AAGGAGAAGC TACCTCATAT GGTGGAATGG TGGACCAAAG 360 

15 

CGCACAATCT CCTATGTCAG CAGAAGATTC AGAAGTTTCA GATAGCCCAG GTGGTTAGAG 420 

AGTCCAATGC AATGCTCAGG GAGGGATATA AGACCTTCTT CAACACACTC TACCATAACA 480 

20 ACATTCCCCT TTTCATCTTT TCTGCGGGCA TTGGTGATAT CCTGGAAGAA ATTATCCGAC 540 

AGATGAAAGT GTTCCACCCC AACATCCACA TCGTGTCTAA CTACATGGAT TTTAATGAAG 600 

ATGGTTTTCT CCAGGGATTT AAGGGCCAGC TGATACACAC ATACAACAAG AACAGCTCTG 660 

25 

TGTGTGAGAA CTSTGGTTAC TTCCAGCAAC TTGAGGGCAA AACCAATGTC ATCCTGCTGG 720 

GAGACTCTAT CGGGGACCTC ACCATGGCCG ATGGGGTTCC TGGTGTGCAG AACATTCTCA 780 

30 AAATTGGdT CCTGAATGAC AAGGTGGAGG AGCGGCGGGA NCGCTACATG GACTCCTATG 840 

ACATCGTGCT GGAGAAGGAC GAGACTCTGG ATGTGGTCAA CGGGCTACTG CAGCACATCC 900 

TGTGCCAGGG GGTCCAGCTG GAGATGCAAG GCCCCTGAAG GCGCAGGCTN CCAGNCCGCC 960 

35 

TGCAGGCCGT GGTGAGGAGG GGCGCCTCCC CAGAGTCTGC TCCCCCGTGA ACACAGAGCA 1020 

GANGCCAGGG TGGCCAGCAG TGGCTGGGTC CTTCCGCGCC CCTCCGTCCT CCTTTCCCTG 1080 

40 AGCACCTTCA TCACCAGAGG CTTGAAGGAA CCCCGCCATG TGGCAGGGCA CAGGCACTGT 1140 

TCCTGGTGAA CCTTGGACCA CAGCATGTCA GTGCTCTAGG GATTGTCTAC TCCAGGGATT 1200 

TTCTTCAAAA TTnTAAACA TGGGAAGTTC AAACAAATAT AATGTGTGAA ACAGATCAAA 1260 

45 

ATTTTTAAAA TGAAAAAAAA GCTGCTCTGA TTCAGGGGAT GTGGGTCGGG GTAGAACCTG 1320 

GACCTCTTGG CCTGGGGGCA CATGGGATGC TTCTAGGAAC ACAGTTTGAG AACCACCAAA 1380 

50 AAAAAA 1386 



55 (2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2098 base pairs 
<B) TYPE: nucleic acid 
60 (C) STRANDEEaSIESS : double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

AGCCCTTCTC CCCGCGCTTG GGACTCTGAC ATCTTAAGGC TGCACGGTCG TGTCCTTGTC 60 

TGGGTGAGGC CATGTCTGTG ATCCAAGGTT CCTGGAACTG ACACAGGAAG GC3GCTGTGAA 120 

CCCTAAGTGG GTGTMATCTC CTCCRACCGA GGCTTCTOAC CCTQGAGATG GCAGTTACTC 180 

CTGGCCATGG TTGCTGAGCA TGGGCAGACC AGTGGAGGCC ACCCTACTGT GTTATCTGCG 240 

CCTTCRATGA AGTGAGACCC TTGGGGAGAA CGGGCTGTGG ATGAAGGAGT GGACTGCAGC 300 

CTTGGCCTAG CCACTGGGCT GGGATCTTCT GGGTCATGTG ACTGTGTATC CAGGAGCAGA 360 

AACTTGTATT CTCAGGATTC AGGATCTACC CAGCACCAAA GATGTATTTT CAGGAGAACA 420 

GACCTAGAAA TGGGCCTGTC TGGCATTTCA GAGTCAGGCA AAGCAGGCAG GGCCAGGGAG 480 

CTTCTGTGGG TCTACACAAG AAGGTTCCTG TGAGGGCTAT CAGTTGTTGC CTTCTAGCTT 540 

GCTGGTAACT TTCGCGCCTC CGCCAAGCCC TGCCAGACTC CCCTGGCTGT GATGGCATTC 600 

TGTGCCATCC TGCCTTGTCC CCAGCCTCTG CAGGATGCCC TCCCTACCCA MCTYTYCCTG 660 

GGCCTTCCCT GTCCACTGGG CTGGATTCAT GTTCAAACCA CTGGACTGGC AGGGCAACGA 720 

CTTCaTCCCA CCTCAAGATG AGGTCCTCGC CCCCTTGTCT TGGCATAAAA ACACCTTTAA 780 

AGCATGAGCC ATGTGCTTCT TTGCCCTTCT CTCTCCTGIT CCAATCTTCT GCCTCCCAGT 840 

CACTCCCTGG GGACTATGGG ATCACTGTCC CCCCACCTGT GTGGCCACAC CATGTGTCCT 900 

GTCAATCCAG AACTGCCTCT GAGCTCCAGG CTGACCACAG ATCAGCCACA GCCTGATGCC 950 

TGCAGCCCCA CTTTGCTCAC CCTTCCCCTC CCCTCCTCCT TCCTTCCACA CAGCAAGCCT 1020 

ACCTTTYTCC ATCCATGCTC ACCATAGCCC CCTTCCTTGT GACCTGGACC CTCCATTGTA 1080 

CCTGGCTGAG ACTGTCAGCC TCCTGGAGGA GTGGGGTCCA CCTTCTTCTT GCCCTATGCA 1140 

GTGCAAGCTT CACTTCTCAC CCAGCAAGGT TGACTCATCT GCCTCCATGT CTCTGGGGCT 1200 

TTGCTGTTGC CCTGAAACCT AGCTGGGCTG GTCTTGCTCC CAGCTTGCTT CCCCCTCCTC 1260 

GGATGTCCCT TTGCAGGCCC CTGTCGTTCC TCCGGCACCA GTGTCCTTGG CTGCCATGGC 1320 

AAGCTCATCA GGGGCTTGTA CCCTGGTCAC CAAGCATGGT AGCAGCTGCC TGCATTGTAT . 1380 

CTCCATCTGG TCACTGCAGG TGCCAACCCT TCATCCCCCA TGTTTTCCTG QGCCATGGAG 1440 

GGCTGACCTC CGTTTCTGGG GAATGTGGCT GAGCTGTGGT AACCAGCTAC ACCCCAGGTG 1500 

CTCTTTCCAT GGTGGTGCCT GCTCATCTTG CTGATGCAAA CTAGGAAGTT AGGCTGCATC 1560 

TCGGAGTGGC TTTCGCTGGA GAGGTGCTTT GCTGTCTCTC AGACTCAGTC ACTGTGTTCC 1620 

CTCCCCGCCT CTCTTATCTC CATGGCTGTT TGCAGCTCTC CCAGGTACTT TGGGGTCTGA 1680 
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GCTGGAATTC CTTTGTGGTT TGCTCTTCTG CTTCTCACTC TTGTATTAAG AAGGATTCCA 1740 

CAAAGGGAGA GTGGCATCCC TGCTGCTGCT GTGCCAGACC AGAGTTTCCT GAGGGGCCCT 1800 

-5 GACCCTAACC CTCCAGCTCA GCCCTGTACA CCTGACCCTG TAAATGAGTG GGGTTTGCTG 1860 

ACTGTAATCC CTGACACCAG TAAAACCAAA AGGACTCTTG GGGGCTCAGT GTGAGAGCCA 1920 

GGGTTACCTA CTCTGCCAAG TGAGGACAAA CTGCTAGGCT GTATCCCATA ATTTCAGGAT 1980 

10 

GAGAAACATT AACAATAAAA ATTTGTAGTA AACATAACCT CATGANGACT AAAAAAAAAA 2040 

AAAAACTVGG GGGGGGGCCC GTAACCCATT GGGCCCTTNG GGGGGGNGTT TTAAAAOT 2098 

15 



(2) INFORMATION FOR SEQ ID NO; 149: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPODOGy: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
TCGACCCACG CGTCCGAACT GAGGCGGCGG CGGGAGCCGG TTGGKGTCTG GTCTTCGCGT 60 
30 CGGCCCCGCG GACCAGACGC TGCCCCCGGC GCGGGGAGAA GATQGTGCCK AGCGGCCTCG 120 
GGCCCGCCAC GCGCCGCCAC GAGTGAGCCC AGCGCGACCG CGGGCGTCCG CCGAGCAGCT 180 
GGCCCGGCTG GGCCCGGGGC GCGCANTGCC CGCCGGGGCG GGGTGGAGCT GATCAGAATA 240 

35 

ATGTTCAGCA TCAACCCCCT GGAGAACCTG AAGGTGTACA TCAGCAGTCG GCCTCCCCTG 300 



GTQGTCITCA TGATCAGCGT AANGCCCATG GCCATAGCTT TCCTGACCCT GGGCTACTTC 360 

40 TTCAAAATCA AGGAGATTAA ATCCCCAGAA ATGGCAGAGG ATTGGAATAC TTTTCTGCTA 420 

CGGTTCAATG ATTTGGACTT GTGTGTATCA GAGAATGAAA CCCTCAAGCA TCTCACAAAC 480 

GACACCACAA CTCCGGAAAG TACAATGACC AGCGGGCAGG CCCGAGCTTC CACCCAGTCC 540 

45 

CCCCAGGCCC TGGAGGACTC GGGCCCGGTG AATATCTCAG TCTCAATCAC CCTAACCCTG 600 

GACCCACTGA AACCCTTCGG AGGGTATTCC CGCAACGTCA CCCATCTGTA CTCAACCATC 660 

50 TTAGGGCATC AGATTGGACT TTCAGGCAGG GAAGCCCACG AGGAGATAAA CATCACCTTC 720 

ACCCTGCCTA CAGCGTGGAG CTCAGATGAC TGCGCCCTCC ACGGTCACTG TGAGCAGGTG 780 

GTATTCACAG CCTGCATGAC CCTCACGGCC AGCCCTGGGG TGTTCCCCGT CACTGTACAG 840 

55 

CCACCGCACT GTGTTCCTGA CACGTACAGC AACGCCACGC TCTGGTACAA GATCTTCACA 900 

ACTGCCAGAG ATGCCAACAC AAAATACGCC CAAGATTACA ATCCTTTCTG GTGTTATAAG 960 

60 GGGGCCATTG GAAAAGTCTA TCATGCTTTA AATCCCAAGC ITACAGTGAT TGTTCCAGAT 1020 
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GATGACCXJTT CATTAATAAA TTTGCATCTC ATGCACACCA GTTACTTCCT CTTTGTGATG 1080 

GTGATAACAA TGTTTTGCTA TGCTGTTATC AAGGGCAGAC CTAGCAAATT GCGTCAGAGC 1140 

AATCCTGAAT TTTGTCCCGA GAAGGTGGCT TTGGCTGAAG CCTAATTCCA CAGCTCCTTG 1200 

ITTTTTGAGA GAGACTGAGA GAACCATAAT CXTTTGCCTGC TGAACCCAGC CTQGGCCTGG 1260 

ATGCTCTGTG AATACATTAT CTTGCGATGT TGGGTTATTC CAGCXILAAAGA CATTTCAAGT 1320 

GCCTGTAACT GATTTGTACA TATTTATAAA AATCTATTCA GAAATTGGTC CAATAATGCA 1380 

CGTGCTTTGC CCTGGGTACA GCCAGAGCCC TTCAACCCCA CCTTQGACTT GAGGACCTAC 1440 

CTGATGQGAC GTTTCCACGT GTCTCTAGAG AAGGATTCCT GGATCTAGCT GGTCACGACX5 1500 

ATGTTTTCAC CAAGGTCACA GGAGCATTGC GTCGCTGATG GGGTTGAAGT TTGGTTTQGT 1560 

TCTTGTTTCA GCCCAATATG TAGAGAACAT TTGAAACAGT CTGCACCTTT GATACGGTAT 1620 

TGCATTTCCA AAGCCACCAA TCCATTTrGT GGATTTTATG TGTCTGTGGC TTAATAATCA 1680 

TAGTAACAAC AATAATACCT TTTTCTCCAT TTTGCTTGCA GGAAACATAC CTTAAGiTTl' 1740 

TTTTGTTTTG TTTTrGTTTT TTTGTTTTTT GTTTTCCTTT ATGAAGAAAA AATAAAATAG 1800 

TCACATTTTA ATACTACCAA AAAATGGACA AAAAAAGTCG AGGGGGG 1847 



35 
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(2) INFORMATION FOR SEQ ID NO; 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1569 base pairs 

MB)~TypR:-naclei:ci-a-cid- - - ~ " 

(C) STRANDEDNESS: double 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

GACGCTGACG AGAGAAGGCC TCTTCCTTGA GGGTTGGTGC TGTGTTGCAG TGACCGTGGC 60 

GGATTACGCC AACTCGGATC CGGCGGTCGT GAGGTCTGGA CGAGTCAAGA AAGCCGTAGC 120 

CAACGCTGTT CAGCAGGAAG TAAAATCTCT TTGTGGCTTG GAAGCCTCTC AGGTTCCTGC 180 

AGAGGAAGCT CTTTCTGGGG CTGGTGAGCC CTGTGACATC ATCGACAGCA GTGATGAGAT 240 

GGATGCCCAG GAGGAAAGCA TCCATGAGAG AACTGTCTCC AGAAAAAAGA AAAGCAAGAG 300 

ACACAAAGAA GAACTGGACG GGGCTGGAGG AGAAGAGTAT CCCATGGATA TTTQGCTATT 360 

GCTGGCCTCC TATATCCGTC CTGAGGACAT TGTGAATTTT TCCCTGATTT GTAAGAATGC 420 

CTGGACTGTC ACTTGCACTG CTGCCTTTTG GACCAGGTTG TACCGAAGCA CTACACGCTG 480 

GATGCTTCCC TGCCTTTGCG TCTGCGACCA GAGTCAATGG AGAAGCTGCG CTGTCTCCGG 540 
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GCTTGTGTGA TCCGATCTCT GTACCATATG TATGAGCCAT TTGCTGCTCG AATCTCCAAG 600 

AATCCAGCCA TTCCAGAAAG CACCCCCAGC ACATTAAAGA ATTCCAAATG CTTACTTTTC 660 

_5 TGGTGCAGAA AGATTGTTGG GAACAGACAG GAACCAATGT GGGAATTCAA CTTCAAGTTC 720 

AAAAAACAGT CCCCTAGGTT AAAGAGCAAG TGTACAGGAG GATTGCAGCC TCCCX3TTCAG 780 

TACGAAGATG TTCATACCAA TCCAGACCAG GACTGCTGCC TACTGCAGGT CACCACCXTTC 840 

10 

AATTTCATCT TTATTCCGAT TGTCATGGGA ATGATATTTA CTCTGTTTAC TATCAATGTG 900 

AGCACGGACA TGCGGCATCA TCGAGTGAGA CTGGTGTTCC AAGATTCCCC TGTCCATGGT 960 

15 GGTCGGAAAC TGCGCAGTGA ACAGGGTGTG CAAGTCATCC TGGACCCAGT GCACAGCGTT 1020 

CQGCTCTTTG ACTGGTGGCA TCCnCAGTAC CCATTCTCCC TGAGAGCGTA GTTACTGCTT 1080 

CCCATCCCTT GGGQGCAGCC TCGAGTGTAG TCCATTAGTA ATCAGATTCC AGTTTGGACA 1140 

20 

GGGTGGCTGG ATTGTATATC TCGTTAGTAA TGTACATGCT CTTCAGGTTC TAC3GGCTCCT 1200 

GTTAGGGGAG GGAGAAATGT TGAATCAAGA GGGAAAACAA CTACTATGAT TTATAAACAT 1260 

25 ATTTTAATGT AAAAATTTGC ATTTAAAAGG AGTGGCCCTG TTTTCTGTGT TAAAACCCCA 1320 

TTTGGTGCTA TTGAGTTTGT TCTTTATTCT TTTATCCCAG TGAAAATTGT TGATCTTGCT 1380 

GTAGGGAAAA ATTAAACTCT TTGAATCTCC AAACAAGGAA GTTTCAGCAT TCCCTTATGG 1440 

30 

ATCAGAGGAA CCTTAGAGGC CTGAAATTGT TGCTTCCAGT TTAGCTGCCC CTCAAATTCA 1500 

AGTGAATATT TTCCCTTCTC CCTTTACCCT TCTCCAGAAA TAAAGCAGGT GACAGGGTTT 1560 

35 CAGAATCTT 1569 



40 (2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1540 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: doiible 

<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 



50 CCCACGCGTC CGGAAGGATT GACCAGTTAA CCAACATCTT AGCCCCCATG GCTGTTGGCC 60 

AGATTATGAC ATTTGGCTCC CCAGTCATCG GCTGTGGCTT TATTTCGGGA TGGAACTTGG 120 

TATCCATGTG CGTGGAGTAC GTCCTGCTCT GGAAGGTTTA CCAGAAAACC CCAGCTCTAG 180 

55 

CTGTGAAAGC TGGTCTTAAA GAAGAGGAAA CTGAATTGAA ACAGCTGAAT TTACACAAAG 240 

ATACTGAGCC AAAACCCCTG GAGGGAACTC ATCTAATGGG TGTGAAAGAC TCTAACATCC 300 

60 ATGAGCTTGA ACATGAGCAA GAGCCTACTT GTGCCTCCCA GATGGCTGAG CCCTTCCGTA 360 
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CCTTCCGAGA TGGATGGGTC TCCTACTACA ACCAGCCTGT GnTCTGGCT GGCATGGGTC 420 

TTGCrrrCCT TTATATGACT GTCCTGQGCT TTGACTGCAT CACCACAGGG TACGCCTACA 480 

5 

CICAGGGACT GAGTGGGTTC CATCCTCAGT ATTTTGATGG GAC3CATCAGC TATAACTGGA 540 

ATAATGGGAA CTGTAGCTTT TACTTGGCTA CGTCGAAAAT GTGGTTTGGT TCGGCAGGTC 600 

10 TGATCTCAGG ATTGGCACAG CTTTCCTGTT TGATCTTGTG TGTGATCTCT GTATTCATGC 660 

CTGGAAGCCC CCTGGACTTG TCCGTTTCTC CTTTTGAAGA TATCCGATCA AGGTTCATTC 720 

AAC3GAGAGTC AATTACACCT ACCAAGATAC CTGAAATTAC AACTGAAATA TACATGTCTA 780 

15 

ATGGGTCTAA TTCTGCTAAT ATTGTCCCGG AGACAAGTCC TGAATCTGTC CCCATAATCT 840 

CTGTCAGTCT GCTGTTTGCA GGCGTCATTG CTGCTAGAAT CGGTCTTTGG TCCTTTGATT 900 

20 TAACTGTGAC ACAGTTGCTG CAAGAAAATG TAATTGAATC TGAAAGAGGC ATTATAAATG 960 

GTGTACAGAA CTCCATGAAC TATCTTCTTG ATCTTCTGCA TTTCATCATG GTCATCCTGG 1020 

CTCCAAATCC TGAAGCTTTT GGCTTGCTCG TATTGATTTC AGTCTCCTTT GTGGCAATGG 1080 

25 

GCCACATTAT GTATTTCCGA TTTGCCCAAA ATACTCTGGG AAACAAGCTC TTTGCTTGCG 1140 

GTCCTGATGC AAAAGAAGTT AGGAAGGAAA ATCAAGCAAA TACATCTGTT GTTTGAGACA 1200 

30 GTTTAACTGT TGCTATCCTG TTACTAGATT ATATAGAGCA CATGTGCTTA TTTTGTACTG 1260 

CAGAATTCCA ATAAATGGCT GGGTGTTTTG CTCTGTTTTT ACCACAGCTG TGCCTTGAGA 1320 

ACTAAAAGCT GTTTAGGAAA CCTAAGTCAG CAGAAATTAA CTGGATTAAT TTCCCTTATG 1380 

35 

. TTGAGGGCCA TGGRAAAAAA ATIK3GGAAAA . GGAAA^^ AGTTmAAM_ ACGGGA^ . ^ 14^0 

ATAATGGATA ACACTGRATT CCXTCTATTTC TCATGAGTAG ATACAATCTT ACGTAAAAGA 1500 

40 GTGGTTAGTC ACGTGAATTC AGTTATCATT TGACAGATTC 1540 



45 {2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1719 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 
55 TACTTATGAG GTCAATTGGA AATAAGAACA CCATTTTACT GGGTCTAGGA TTTCAAATAT 60 
TACAGTTGGC ATGGTATGGC TTTGGTTCAG AACCTTGGAT GATGTGGGCT GCTGGGGCAG 120 
TAGCAGCCAT GTCTAGCATC ACCTTTCCTG CTGTCAGTGC ACTTGriTCA CGAACTGCTG 180 

60 
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ATGCTGATCA ACAGGGTGTC GTTCAAGGAA TGATAACAGG AATTCGAGGA TTATGCAATG 240 

GTCTGGGACC GGCCCTCTAT GGATTCATTT TCTACATATT CXIATGTGGAA CTTAAAGAAC 300 

TGCCAATAAC AGGAACAGAC TTGGGAACAA ACACAAGCCC TCAGCACCAC TTTGAACAGA 360 

ATTCCATCAT CCCTGGCCCT CCCTTCCTAT TTGGAGCCTG TTCAGTACTG CK5GCTCTGC 420 

TTGTTGCCTT GTTTATTCCG GAACATACCA ATTTAAGCTT AAGGTCCAGC AGTTGGAGAA 480 

AGCACTGTGG CAGTCACAGC CATCCTCATA ATACACAAGC GCCAGGAGAG GCCAAAGAAC 540 

CTTTACTCCA GGACACAAAT GTGTGACGAC TGAAATCAGG AAGATTTTTC TATCAGCACC 600 

CAGGTCTTAG TTTTGACXTTC TAGTTCTGGA TGTACATTCC ATTTCCATCC ACAGTGTACT 660 

TTAAGATTGT CTTAAGAAAT GTATCTGCAT GAACTCCX3TG GGAACTAAAG GAAGTGGGAA 720 

CTTAGAACCA GACAGTTTTC CAAAGATGTT ACAATTTCTT TTGAAAAACC TTTTGTTTAT 780 

TAGCACCAAT TTCTYGCCAC TAAGCTATTT GTTTTATTAT ACATCCTTTA ATTAAAAACT 840 

ATATATGTAA CTTCTTAGAT ATTAGCAAAT GTCTCTGCTA CCATTTCCTT AAGGTGrrGA 900 

GCTTTAACTC TATGCTGACT CAGTGAGACA CAGTAGGTAG TATGGTTGTG GACCTATTTG 960 

TTTTAACATT GTAAAATTTT GAGTCAGATT TTAATATTGT AAAATCTTGG GTCAAATAAT 1020 

TCAAAGCCTT AATGCAGATG CACTAAAACA AAGAAATGGT AAATGAATTG TTTGCATTTA 1080 

AAAAAAAAAA CTCTTAAGAA AACTGTACTA AATCTGAATC ATGTTTTGAG CTTGTTTGCA 1140 

GTACTTTTT^ ACATTATTCA CTACTGTTTT TGAAGTGAGA AAGTATCAGC CATTTAGCAT 1200 

TTAAGTTGGG GTATTTAGAG CCTGTAATCT AAATGCTGGC TCAAATTTAT TCCCCAGCTA 1260 

CTTCTEATAC CACrATTCTT TTAATGTTTG -CATAATGATA-AGGAGGTCAA. XZACTTGAATA 1320- 

CATAATCTAA AAA1TATATA GTAAAGCTGG TAGCCTTGAA AATGTCAGTG TGATATCTAT 1380 

TATGTAGATA AATATATATA GTGGCCTTTC AGGACTGTCA CAGTAACACT TTATTTACAG 1440 

AGCTAATGTT TGTCCTAAAT TTTCAGGACC CTAGAGGAGA GCTTTATACA ATTACX:GATC 1500 

TGAATTTCTC TAAAGTGTAT ATTTTTGTGT CXAGTTATAT TATTTAAAAA AGTGTTACTT 1560 

TGTAAAAATT GTATATAAAG AACTGTATAG TTTACACTGT TTTCATCTTG TGTGTGGTTA 1620 

TTGCTTAATG CTTTTTAAAC TTGGAACACT CACTATGGTT AAATAAGGTC TTAAAAGAAA 1680 

TGTAAATATT YTGTTAATAA AGTTAAATAT TTTAATGAT 1719 



55 
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(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 863 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : dOLible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

_5 

GGCACGAGGG AAGCCGGGAC GATGTCCGCA TGACAACCGA CGTTGGAGTT TGGAGGTGCT 60 

TGCCTTAGAG CAAGGGAAAC AGCTCTCATT CAAAGGAACT AGAAGCCTCT CCCTCAGTGG 120 

10 TAGGGAGACA GCCAGGAGCG GTTTTCTGGG AACTGTGGGA TGTGCCCTTG GGGGCCCGAG 180 

AAAACAGAAG GAAGATGCTC CAGACCAGTA ACTACAGCCT GGTGCTCTCT CTGCAGTTOC 240 

TGCTGCTGTC CTATGACCTC TTTGTCAATT CCTTCTCAGA ACTGCTCCAA AAGACTCCTG 300 

15 

TCATCCAGCT TGTGCTCTTC ATCATCCAGG ATATTGCAGT CCTCTTCAAC ATCATCATCA 360 

TTTTCCTCAT GTTCTTCAAC ACCTTCGTCT TCCAGGCTGG CCTQGTCAAC CTCCTATTCC 420 

20 ATAAGTTCAA AGGGACCATC ATCCTGACAG CTGTGTACTT TGCCCTCAGC ATCTCCCTTC 480 

ATGTCTGGGT CATGAACTTA CGCTGGAAAA ACTCCAACAG CTTCATATGG ACAGATGGAC 540 

TTCAAATGCT GTTTGTATTC CAGAGACTAG CAGCAGTGTT GTACTGCTAC TTCTATAAAC 600 

25 

GGACAGCCGT AAGACTAGGC GATCCTCACT TCTACCAGGA CTCTTTGTGG CTGCGCAAGG 660 

AGTTCATGCA AGTTCGAAGG TGACCTCTTG TCACACTGAT GGATACTTTT CCTTCCTGGA 720 

30 TAGRAGGCCA CATTTGCTGC TTTGCAGGGG AGAGTTGGGC CCTATGCATG GGGCAAAACA 780 

GGTGGGATTT TCCAAGGGAA GGGTTCAGAA TTAGGCNTGT TGTTTCAGCC ATTTCCAAGG 840 

AAC3GGGAAGG GTTTCCCTNC CCT 863 

35 



40 
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(2) INFORMATION FOR SEQ ID NO: 154: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxjble 
45 CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

AACAGCAAAA AAGAATGATT TCTTCTGAAA TTGTGGAACA TGAGGATTCA AGTTTTTATT 60 

TTGTTACTAG GTGCTGGAGG AACATCCCAG TTCACAAAGC CCCCATCTCT TCCTCTGGAG 120 

CCAGAGCCTG CGGTGGAATC AAGTCCAACT GAAACATCAG AACAAATAAG AGAGAAATAA 180 

55 GAATAQAATG AATGACCCCA AAATARGGTT TTCTTGGGCG AGGATGTGCT GGATTAGGAA 240 

AGGTGACATG ACACAGGCAG AGCAGAGTGG CAC-CCACCAC AGAATACAGT GTGTGTTATT 300 

ACGAGGAGCC AGCAGTTGAG CCTAAGGTCC TTCTACCTAC CTGGTATTGG CATTTGAGGT 360 

60 
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CGGAAACXrCT CTACTGCCCC ATAAGCCAGG 
GAACTGGCAG CAAGGCTTGA GGCCTTATGT 
-5 TGTCTGCTTT CAAAAGGACT TTTCTCTCCT 
ACAGCTGAGA CAGACOTCTG CTGAGTAGCT 
GTGATCCTCA GGTTGTGAGG TTTATTAGTC 

10 

ATCCAACTTT AATAGTATAC ATTTAAAAGA 
CAAGCCTGCT GAGTATTGCA GCTGCATTTG 
15 TGTATTTTGT TCTTGAGAGG GGTCAGTCTA 
CAGCCCAGGG CrrTGTCTGGA TCAGCACCAA 
AGATGAGTAA TTGTTATTGA AGATAGTCAG 

20 

ACTATGTGTC CTTTTTAGAA TAAAGATTAC 
TTCAGGTATA AAAACAAGAG ATTATAATAA 
25 CTCGTGCCGA AITCCCTGCA G 



379 



AAAAGTGAAA AGAGAACACA GTTCCTTTAA 420 

ATGTAGCTGA GTCAGCAAGG TACATGATGC 480 

AGCTGACTGA CTCCTTCCTT AGTTCAAGGA 540 

CTGTGATGAC AAAGCXTTTGC TTTAACTGAG 600 

CCCAAGGCAA ACACAAATAT TAGATTAATA 660 

AAAAAAACAA AAGCCCTGGA AGNTTGAGGC 720 

CCCAAAGGGA ATCCAGAACA AGTCCCTCCC 780 

GAAGCTAGAT CCTATCAGGA TGAGGAGCAG 840 

CGATTTTAAA GAAAAAAGGA AGAGTTTCTT 900 

TGATAACCAC TGACCAGATG CTATCAATAC 960 

ATATCATCAT TCCTTrGGGG AAAATTGTTA 1020 

AAAANTAAAA GAACCCTAAA AAAAAAAAAC 1080 

1101 



30 <2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2031 base pairs 

(B) TYPE: nucleic acid 
35 <C) STRANDEDNESS: double 

<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 



40 CAATTAACCC GTTTGAGGCC TAGGTTGTTT GGCAAGCCCC NGGCCTAAAG TTTTAATTCG 60 

GCAGAGCCAA GGGCCTGAAA GGAAGGGAAA GGGGAGQGTA GCGGGAGGGT AGCAGGTGAG 120 

TTCCTAGGGC TGGAAGGTTT AGCAGCAGCC TGGTGCAGTG CCCTGTCATC AAGACAAACC 180 

45 

CACGGTCCTC CTGGGTGCCT ACCAAGCTTG GTTTGTACAA AAGCAAGGTG GGAGTCTATT 240 

TTTGTACATG AGATACATCA CACTTACCTG TGGGCCAGTA TTGTGAAGTG AGTCTGAGTT 300 

50 GTTTACACTG ATGCCTTCCC TGCCCACCAC AAATTGTGTA CATAGTCTTC AGAATGATAC 360 

CACCCCTTTC CCCAGCTCCC AACCAAGAGC TGGTTCTAGG CCTGTGTTAT ATGTCATATT 420 

TAGCGTTTTT ATATATGACC TTTGATTTCT GTTGTTTGTA TTTTAGCACA GTGTATGCAC 480 

55 

CTTCATTTAA ATACATCTGT GTGCATACAG ATACGCATAT ATGTGTGTGC GTATGCATAT 540 

ATCTCTCATC TGTAGTTTCC AAGAGTTCAG CTGAAGCAGA TGGAGTCCTG CAGCCCAGGA 600 
60 GACACCCTGC ATCCCTGCTA ATAGTGTTTG CCACAAGTAT TAGTGAGTCT TCCTTATTAA . 660 
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TATTTTCATT TCAGAAGACT GAAGCAAAGC TGATAGTGTT TGCTGTTTCT TTGGCAGCTA 720 

AGTGAGGGTC TTCGGATGAC TTGCTGTGTT CCTCAAGCTG CACTTTGGGG CXirATCTCTGC 780 

-5 

AGTATTAAGC CCCCTTTTTG CTTGGTGGTA CTCTGTCTGT GCCTGTGTGT GTGTGTGATA 840 



GTCACTCTTG CATGGCTTCC ATGTCTGGTT TGTGGCATTT GGGGATAAGT GCTGAACCAG 900 

10 AGCATTTGCA GTTTGnTGA GGCCTCGTTG CCAATGATAG ATCACTCCTG TTGACCTGGT 960 

ATGTCTGCTT GCTTGCTGCT TTTCCTTGCT ITCTCTTGGA AGAC3GAAAGG ACTCTGGTCA 1020 

GGCCCAGGCT GAGTGAGATG AGCTGCAGCT GGCTCATGGC CTTCTTAGAG CAGAGAGAGG 1080 

15 

AGTATGTCAT TTTACTAAGT TCCTAAACAA ACATTTATGC AGGCAACACT CCTTGCAGAT 1140 

CTAGAAACTG AGGCACAATA GGGTTATGAC TTGCTCAAGA ATATGTAGCT GCTAGGGGGT 1200 

20 AAATCAAGGC ATCACAATTT CTGTTCAGCG GGCAGGAATA GGCTGTGAAT TCCTAGCACT 1260 

TTTTTTTTAA GCAATTACTT TTTGACTTGT TCCTCTGAAA GTGCAAGAGG CGTACACXnT 1320 

TCCCAAATCT AGACTAGAAT CTGCAGGATG CCACCCACTG TATAGTTCTG CiU'iCCXAGA 1380 

25 

GAGGAAGAAC TTTTAGAAAC CAAATGATCT TAATTGTTAT TGCCCACCCC TGGCTTTTCC 1440 

GGGTAGAAAA TTCACAGTAG GAATGATTGT TAAGAGAGAG TC5CTTGGAAC CATGGGTTAA 1500 

30 CAGGAAAGGC TACCTAACTT CACATATCTG CAACCAGAGC AGCCACCAAG CATTACTTAG 1560 

CAGCAGGAAA AIGATTGTAT TTGAGTTCCT GTGTGTCCAA AACTGAGGCA CCATGTTCTT 1620 

TGAAAACATG CCACCTCAAG GCTGGGCGCG GTGGCTCACA CCTGTTAATC CCAGCACTTT 1680 

35 

GGGAGGCCGA GGCGGGCGGA TCACCGGAGT CGGGGAGTTT GAGACCAGCC TQGACCAACA 1740 



TGGGAGAAAC CCCATCTCTA CCTAAAAATA CAAAATTAGC CGGGCGTGGT GGCATGCGCC 1800 

40 TATAATCTCA GCTACTTGGG AGGGYTGAGG CAGGRGAATT GCTTGAACCX: RGGANGGCGG 1860 

AGGTTTGCX5G TTGAGTTGAG GATCGTGCCA TTGCACTTCC GGGCCTTGGG GCAACAACAG 1920 

CAAAAAYTCC GTCTTCAAMW MimSCCGAAT TCGATATCAA GCTTATCGAT ACCXSTCGACC 1980 

45 

TCGAGGGGGG GCCCGGTACC CAATTCGCCC TATAOTCATC GTATTACAAT C 2031 



50 



60 



(2) INFORMATION FOR SEQ ID NO: 156: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1981 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
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CCTGCACCCT GAGCCCTTCA CCCCTCCGAG TTCCCCCCAG GTTGGCTTCC TTCGATTCCT 60 

TTTCTTGGTA TCAACGTTTG ATTOGAAGAA CAACCCCXrTC TTTGTCAACC TCAATAATGA 120 

-5 GCTCACTGTG GAGGAGCAGC TCGGGCACAG CTCMCCGTYA OXSGTCATTGT TACCCCCTAA 180 

GACCGCAAAA ACTCTGTGTG GACACAGGAT GGACCCTCAG CXTCAGATCCT GCAGCAGCTT 240 

GTGGTCCTGG CAGCTGAAGC CCTGCCCATG TTAGAGAAGC AGCTCATGGA TCCCXTOGGGA 300 

10 

CCTGGGGACA TCAGGACAGT GTTCCGGCCG CCCTTGGACA TTTACGACX3T GCTGATTCGC 360 

CTGTYTCCTC GCCATATCCC GCGGCACCGC AGGCTTGTGG ACTCGCCAGV TGCCTCCTTC 420 

15 TGCCGGGGCC TGCTCAGCCA GCCGGGGCCX: TCATCCCTGA TGCCXXSTGCT GGGTNATGAT 480 

CCTNCTCAGC TCTATCTGAC GCAGCTCAGG GAGGCCTTTG GGGATCTGGC CCTTTTCTTC 540 

TATGACCAGC ATGGTGGAGA GGTGATTGGT GTCCTCTGGA AGCCCACCAG CTTCCAGCCG 600 

20 

GAGCCCTTCA AGGCCTCCAG CACAAAGGGG CGCATGGTGA TGTCTCGAGG TGGGGAGCTA 660 

GTAATGGTGC CCAATGTTGA AGCAATCCTG GAGGACTTTG CTGTGCTGGG TGAAGGCCTG 720 

25 GTGCAGACTG TGGAGGCCCG AAGTGAGAGG TGGACTGTGT GATCCCAGCT CTGGAGCAAG 780 

CTGTAGACGG ACAGCAGGAC AITGGACCTC TAGAGCAAGA TGTCAGTAGG ATGACCTCCA 840 

CCCTCCTTGG ACATGAATCC TCCATGGAGG GCCTGCTGGC TGAACATGCT GAATCATCTC 900 

30 

CAACAAAACC CAGCCCCAAC TTTCTCTCTG ATGCTCCAGC ATTGGGGCAG GGGCATGGTG 960 

GCCCATGTAG TCTCCTGGGC CTCACCATCC CAGAAGAGGA GTGGGAGCCA GCTCAGAGAA 1020 

35 GGAACTGAAC CCAGGAGATC CATCCACCTA TTAGCCCTGG GCCTGGACCT CCCTGCGATT 1080 

'T^CCACTCCT"TTCTTAGTCT~TCTTCC^^ 1140" 

CTCTGTCTCC TTCCTGCTGC CAGGACCTGT GCCTAGACTT AGCATGCCCT TCACTGCAGT 1200 

40 

GTCAGGCCTT TAGATGGGAC CCAGCGAAAA TGTGGCCCTT CTGAGTCACA TCACCGACAC 1260 

TGAGCAGTGG AAAGGGGCTA TATGTGTATG AATAGACCAC ATTGAAGGAG CACAATGCCC 1320 

45 TCCTGTGTTG ATGCCACTTC CCAGGGTGGA GACAGTGGAA AAGAACCGAG GACAGGAAAG 1380 

GATTGGGTAG GTGAAGGGGT CAGGGGACTG GTAGTCACCC AATCTTGGAG AGGTGCAAAA 1440 

AGCACTGGGG GCTACCCGTT AGCTGCATCT GCCCTGGCTG ITTGCCCGTT CATGTrCACAA 1500 

50 

ACTGCCACTA CTATGTACCT GCAGTGGGGT TGCAGAGATG GGGGAGACTC AAGTCTTACT 1560 

CCCCAGGAGC TCCCAGGGCC CAAGGAGGAG AATGCTGCCT CCTTTCAGTC TGGTCTACAC 1620 

55 CCACTTTCTG GTAGCCTCTC TGCTTCCTGT AATTCTGGCT GTTTTTCCAG ACTCAGCTCA 1680 

AATAGTGCCC CTCCTTAAGC CCATCCCTCG CCCCCAGCCT GAGGTGATCT TTCCCTCCTC 1740 

TGAACTATTA GAGCAGTTAC TGTCTGTTCA GTTCGTTTGG CAGGCACACA CAGTGGCATA 1800 

60 
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AATTCTATTG TTTTGAACTC TGATTTAAAA TTAAATTGCA GCTGGGCGTG GTGQCTCATG 
CTTGTAATCC CAACACTTAG GGAGTMAGGR GAATCACTTG ASCYCAGGAG TYCTAGACCA 
ATCTGGGCAA MAGAGAGACC CX:ATCTCTTT TAAATAAAAA GTTAAATTGC TTAAAAAAAA 
A 

(2) INFORMATION FOR SEQ ID NO: 3.57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 
XD) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

GAATTCQGCA CGAGCGCGGC CATGGCGCTC CTGCTTTCGG TGCTGCGTGT ACTGCTGGGC 

GGCTTCTTCG CGCTCGTGGG GTTQGCCAAG CTCTCGGAGG AGATCTCGGC TCCAGTTTCG 

GAGCGGATGA ATGCCCTGTT CGTGCAGTTT GCTGAGGTGT TCCCGCTGAA GGTATTTGGC 

TACCAGCCAG ATCCCCTGAA CTACCAAATA GCTGTGGGCT TTCTGGAACT GCTGGCTQQG 

TTGCTGCTGG TCATGGGCCC ACCGATGCTG CAAGAGATCA GTAACTTGTT CTTGATTCTG 

CTCATGATGG GGGCTATCTT CACCTTGGCA GCTCTGAAAG AGTCACTAAG CACCTGTATC 

CCAGCCATTG TCTGCCTGGG GTTCCTGCTG CTGCTGAATG TCGGCCAGCT CTTAGCCCAG 

ACTAAGAAGG TGGTCAGACC CACTAGGAAG AAGACTCTAA GTACATTCAA GGAATCCTGG 



1860 
1920 
1980 
1981 



AAGTAGAGCA TCTCTGTCTC TTTATGCCAT GCAGCTGTCA CAGCAGGAAC ATGGTAGAAC 
ACAGAGTCTA TCATCTTGTT ACCAGTATAA TATCCAGGGT CAGCCAGTGT TGAAAGAGAC 
ATTTTGTCTA CCTGGCACTG CnTCTCTTT TTAGCTTTAC TACTCTTTTG TGAGGAGTAC 
ATGTTATGCA TATTAACATT CCTCATGTCA TATGAAAATA CAAAATAAGC AGAAAAGAAA 
TTTAAATCAA CCAAAATTCT GATGCCCCAA ATAACCACTT TTAATGCCTT GGTGTAAGTA 
TACCTCTGAA CTTTTTTCTG TGCCTTTAAA CAGATATATA TTTTTTTTWA ATGAAAATAA 
AACCATATAT CCTATTTTAT TTCCTCCTTT TAAAACCTTA TAAACTATAA MAAAAAAAAA 
AAAAAAAAAA CTCGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
915 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2117 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

AGAGCGAAGC GAGGGTGGCG CGGGTCOGGG CATGAAGCTG GGCCGGGCCG TGCTGGGCCT 60 

GCTGCTGCTG GCGCCGTCCG TGGTGCAGGC GGTGGAGCCC ATCAGCCTGG GACTCGCCCT 120 

GGCCGGCGTC CTCACCGGCT ACATCTACCC GCGTCTCTAC TGCCTCTTCG CCGAGTGCTG 180 

CGGGCAGAAG CGGAGCCTTA GCCGGGAGGC ACTGCAGAAG GATCTGGACG ACAACCTCTT 240 

TGGACAGCAT CTTGCAAAGA AAATCATCTT AAATGCCGTG TTTGGTTTCA TAAACAACCC 300 

AAAGCCCAAG AAACCTCTCA CGCTCTCCCT GCACGGGTGG ACAGGCACCG GCAAAAATTT 360 

CGTCAGCAAG ATCATCGCAG AGAATATTTA CGAQGGTGGT CTGAACAGTG ACTATGTCCA 420 

CCTGTTTGTG GCCACATTGC ACTTTCCACA TGCTTCAAAC ATCACCTTGT ACAAGGATCA 480 

GTTACAGTTG TGGATTCGAG GCAACGTGAG TGCCTGTGCG AGGTCCATCT TCATATTTGA 540 

TGAAATGGAT AAGATGCATG CAGGCCTCAT AGATGCCATC AAGCCTTTCC TCGACTATTA 600 

TGACCTGGTG GATGGGGTCT CCTACCAGAA AGCCATGTTC ATATTTCTCA GCAATGCTGG 660 

AGCAGAAAGG ATCACAGATG TGGCTTTGGA TTTCTGGAGG AGTGGAAAGC AGAGGGAAGA 720 

CATCAAGCTC AAAGACATTG AACACGCGTT GTCTGTGTCG GTTTTCAATA ACAAGAACAG 780 

TGGCITCTGG CACAGCAGCT TAATTGACCG GAACCTCATT GATTATTTTG TTCCCTTCCT 840 

CCCCCTGGAA TACAAACACC TAAAAATGTG TATCCGAGTG GAAATGCAGT CCCGAGGCTA 900 

TGAAATTGAT GAAGACATTG-TAAGCAGKGT tSGCTGAGGAG^TGACATTIT 'TCCCCAAAGA ^60" 

GGAGAGAGTT TTCTCAGATA AAGGCTGCAA AACGGTGTTC ACCAAGTTAG ATTATTACTA 1020 

CGATGATTGA CAGTCATGAT TQGCAGCCGG AGTCACTGCC TGGAGTTGGA AAAGAAACAA 1080 

CACTCAGTCC TTCCACACTT CCACCCCCAG CTCCTTTCCC TGGAAGAGGA ATCCAGTGAA 1140 

TGTTCCTGTT TGATGTGACA GGAATTCTCC CTGGCATTGT TTCCACCCCC TGGTGCCTGC 1200 

AGGCCACCCA GGGACCACGG GOGAGGACGT GAAGCCTCCC GAACACGCAC AGAAGGAAGG 1260 

AGCCAGCTCC CAGCCCACTC ATCGCAGGGC TCATGATTTT TTACAAATTA TGTTTTAATT 1320 

CCAAGTGTTT CTGTTTCAAG GAAGGATGAA TAAGTTTTAT TGAAAATGTG GTAACTTTAT 1380 

TTAAAATGAT TTTTAACATT ATGAGAGACT GCTCAGATTC TAAGTTGTTG GCCTTGTGTG 1440 

TGTGTTTTTT TTTAAGTTCT CATCATTATT ACATAGACTG TGATGTATCT TTACTGGAAA 1500 

TGAGCCCAAG CACACATGCA TGGCATTTGT TCCACAGGAG GGCATCCCTG GGGATGTGGC 1560 

TGGAGCATGA GCCAGCTCTG TCCCAGGATG GTCCCAGCGG ATGCTGCCAG GGGCAKTGAA 1620 
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GTGTTTAGGT 
AACAGCCTCT 
CACACTGTCA 
TTGCCAGTCA 
CGTGCTCTCT 
TACTATTATT 
CTTCTGAGGA 
ATAAAATTTT 
GTAACCAATT 



GAAGGACAAG 
CCAAGGC3TTT 
ACATTTGTTA 
CTCCGTTTTG 
.GCTGGATTCT 
TGTTACGTTC 
TGCTGAGAAC 
GCATGTGCAA 
TCNCCCC 



TAGGTAAGAG 
TCACXTTTAGC 
GAACCAGTCT 
CAAAAGGTGG 
AAATTATGTG 
AATCAGAATC 

GGTGTcrrrc 

AAAAAAAAAA 



GACGCxrrrcA 

AACAATGGGA 
TTTGAAAGAA 

ccxnrcACTG 

AATTTTGCCA 
CCCGAAACCT 
TTTATAAATG 
ANAAAAAAAA 



GGCACCACAG 
GCTGTGGGAG 
AAGTATTTCC 
TCCATTCCAA 
TATTAAATCT 
CCTATAAAGC 
CAAATGGCTA 
AAAATCCCGG 



ATAAGCCTGA 
TGATTTTGGC 
AACTTGTCAC 
ATAGCCCACA 
TCCTCATTTA 
TTAGCTGCCC 
CCGTTTTACA 
GGGGGGGCCG 



(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS r 

(A) liENGTH: 2395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 159: 
TGTTCCTTAA TCCCTTrTCT AAAAAGGGGG GAAAATCCGG ATGGATTTTA GGGATTGGTC 
TGGTGTCAGC TGTGTTTTAT TGCACACCTA AATCCTGATT ATAGGCTTTT CATTTCTCCG 
CAAAGCCTTT ATTTTGGCAG.JrrAAGCCAAA_TGTGTTTTCC_AGAAAC^ 
CTCTTTCTTT CCTTTCTTTC CTCCCTTTTT CCCGTCTGAC CCCAAACGTT ATTGTCCAAA 
CATGACTGGA CAGCAGCTTT TGTTTCTTGA CCCTGTAATA TGACAGTCTG CTAATATTGA 
CAGAAGGTGC AGTTTTTGGG TTATAGTCGT GATTTTCGCT AATCAATCAT ATTAGCAGGA 
AAAAAT^GA CTTGTTTCTG TTGTACTTGA CTCTTAAGAA AAAGTTGGCCC ATAGTTTAGT 
GGACAATTTC CAAAGGCTTT AGTACCACCT GTATTTCAAA ATGGGGGACC CAAACTCCCG 
GAAGAAACAA GCTCTGAACA GACTACGTGC TCAGCTTAGA AAGAAAAAAG AATCTCTAGC 
TGACCAGTTT GACTTCAAGA TGTATATTGC CTTTGTATTC AAGGAGAAGA AGAAAAAGTC 
AGCACTTTTT GAAGTGTCTG AGGTTATACC AGTCATGACA AATAATTATG AAGAAAATAT 
CCTGAAAGGT GTGCGAGATT CCAGCTATTC CTTGGAAAGT TCCCTAGAGC TTTTACAGAA 
GGATGTGGTA CAGCTCCATG CTCCTCGATA TCAGTCTATG AGAAGGGATG TAATTGGCTG 
TACTCAGGAG ATGGATTTCA TTCTTTGGCC TCGGAATGAT ATTGAAAAAA TCGTCTGTCT 
CCTGTTTTCT AGGTGGAAAG AATCTGATGA GCCTTTTAGG CCTGTTCAGG CAAATTTGAG 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2117 



60 
120 
_180^ 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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TTTCATCATX3 GTGACTATGA AAAACAGTTT CTGCATGTAC TGAGCCGCAA GGACAAGACT 960 

GGAATCGTTG TCAACAATCC TAACXIAGTCA GTGTTTCTCT TCATTGACAG ACAGCACTTG 1020 

CAGACTCCAA AAAACAAAGC TACAATCTTC AAGTTATGCA GCATCTGCCT CTACCTGCCA 1080 

CAGGAACAGC TCACCCACTG GGGCAGTTGG CACCATAGAG GRTCACCTCC GTCCTTATAT 1140 

GCCAGAGTAG AGTACTGACC AGCAAAATGG AGAAGATCAG AGAATGCAGC AGCAGTTTTT 1200 

TTrCTTGTTT TCTTACCACT TTATTCTTTC AGAGTTTAAA GAAAATGGAC TCATGCACAG 1260 

AACACTATGC ATTTTGAAAC TTGTTCATCC TGGATTTTTT TAAATCATTT TTATCTCAGA 1320 

ACTTAAACAA AAATTAGATG TCGTGCACGG ACTGTGTGAA AGAAGATGCT TTGCATATTT 1380 

GCTGCACTGC ATCAGTATCT TACTAAAAAT GTGAAATGAA AGGACTATTG TACACTGAAA 1440 

TGCTTAAATG TATCTGAAAG CACAAGGTGA TACTCATTTT TATGGTCTTC CCATTTGTGC 1500 

TGGTTTTTGC CTCTTTGACA TCTGTCATCA GTATTTAGAG GGTGAGAAGT GAATGTAACA 1560 

GGTATAAATA ACATTTTTAA AAACAATAAC TTTGCTATAA TCACAGTrGT TCCAGAGCAC 1620 

TGTCAGATAC ATTCTAATGA CCAGAACTGG TTTAAAAAAA GAAAATACAA CXyvTCQGAAA 1680 

GAAATCTTAA ATGAAAAACG CATCTCATTG TAGGCATTTT TGCCTCATAT TTTACTGGGC 1740 

CATX3TTTGTT TCCTGGTACT CATGTATTTT TTTTTTCCAG ATCTCTTTCC CCAAGTTGCT 1800 

ATTGTAAGAG TATTCTGCTG CX3TGTGGATG CAGTTATACA CATTAAAGCA GATCTGGAGT 1860 

CTGAAGTAGC TATAAAGCAG CTATAAAACA GAAATACATG CATAGCTGCA GAAACCATGA 1920 

TAGGTAGAGG ACTTTTCTTT TGGTTTTGTT TTGTTTTGTT TTGTTTTCTT TTTGGTTTTA 1980 

CAGAGAAGAG ATTTTTATTA CAAAGAAAAA AATTCCAGTG AATTGTGCAG AAATGCTGGT 2040 

TTTTACACCA TCCTAAAGAA AAACTTTACA AGGGTGTTTT GGAGTAGAAA AAAGGTTATA 2100 

AAGTTGGAAT CTTAAATTGr AAAATTAACC ATTGAGTGTC AAAGTTCTAA AAGCAGAACT 2160 

CATTITGTGC AATGAACATA AGGAAAGACT ACTGTATAGG T l " m ''l'i'l''ri' TTCTCCTTTT 2220 

AAATGAAGAA AAGCTTTGCT TAAQGGTTGC ATACTTTTAT TGGAGTAAAT CTGAATGATC 2280 

CTACTCCTTT GGAGTAAAAC TAGTGCTTAC CAGTTTCCAA irGTATTTAG CTTCTGGTTG 2340 

GAATTTGAAA AAAAAAGAAA AAAAGAAAAA GAAAACCTAA ATAAAATAQG TCAAA 2395 



55 (2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2120 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS : doxible 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

-5 CCCCGGATAC CGCCTOACGT AGTGCCAATC ACACCTCTCG CGTCTCGGCG CCTCGGAGGC 60 

TAATGAGGAC GCCTGGCGAA ACGCAGTAAC GGATTTCCGG GTGGACCTTC GCTTTACGGC 120 

TCGTGAGTTC TTCCGCCCAA CCCAGAGGAA GCGGGAGAGC AGTTTAOGAC AGCGCCGGTC 180 

10 

GTGTTTACGG CGGCGCCCGC TGCGCGCGCA TGTTTCCTCT TTTCCTQGTT TCTCAAGAGT 240 

GCTGCTGCTA ACGCGGTCCC CGGCACGCAC CATCTGTTGC CATCCCGGCC GGCCGAGGCA 300 

15 TTGCAGATTT TGGAAGATGG CAAAGTTCAT GACACCCGTG ATCCAGGACA ACCCCTCAGG 360 

CTGGGGTCCC TGTGCGGTTC CCGAGCAGTT TCGGGATATG CCCTACCAGC CGTTCAGCAA 420 

AGGAGATCGG CTAGGAAAGG TTGCAGACTG GACAGGAGCC ACATACCAAG ATAAGAGGTA 480 

20 

CACAAATAAG TACTCCTCTC AGTTTGGTGG TGGAAGTCAA TATGCTTATT TCCATGAGGA 540 

GGATGAAAGT AGCTTCCAGC TGGTGGATAC AGCGCGCACA CAGAAGACGG CCTACCAGCG 600 

25 GAATCGAATG AGATTTGCCC AGAGGAACCT CCGCAGAGAC AAAGATCGTC GGAACATGTT 660 

GCAGTTCAAC CTGCAGATCC TGCCTAAGAG TGCCAAACAG AAAGAGAGAG AACGCATTCG 720 

ACTGCAGAAA AAGTTCCAGA AACAATTTGG GGTTAGGCAG AAATGGGATC AGAAATCACA 780 

30 

GAAACCCCGA GACTCTTCAG TTGAAGTTCG TAGTGATTGG GAAGTGAAAG AGGAAATGGA 840 

TTTTCCTCAG TTGATGAAGA TGCGCTACTT GGAAGTATCA GAGCCACAGG ACATTGAGTG 900 

35 TTGTGGGGCC CTAGAATACT ACGACAAAGC eTTTGACCGC ATCACCACGA GGAGTGAGAA 960 

ckcACTCCGG ~AS^^ GCATCTTCCA CACTGTCACC ACCACAGACG ACCCTCTCAT i020 

CCGCAAGCTG GCAAAAACTC AGGGGAATGT GTTTGCCACT GATGCCATCC TGGCCACGCT 1080 

40 

GATGAGCTGT ACCCGCTCAG TGTATTCCTG GGATATTGTC GTCCAGAGAG TTGGGTCCAA 1140 

ACTCTTCTTT GACAAGAGAG ACAACTCTGA CTTTGACCTC CTGACAGTGA GTGAGACTGC 1200 

45 CAATGAGCCC CCTCAAGATG AAGGTAATTC CTTCAATTCA CCCCGCAACC TGGCCATGGA 1260 

GGCAACCTAC ATCAACCACA ATTTCTCCCA GCAGTGCTTG AGAATGGGGA AGGAAAGATA 1320 

CAACTTCCCC A/^CCAAACC CGTTTGTGGA GGACGACATG GATAAGAATG AAATCGCCTC 1380 

50 

TGTTGCGTAC CGTTACCGCA GTGGNAAGCT TGGAGATGAT ATTGACCTTA TTGTCCGTTG 1440 

TGAGCACGAT GGCGTCATGA CTGGAGCCAA CGGGGAAGTG TCCTTCATCA ACATCAAGAC 1500 

55 ACTCAATGAG TGGGATTCCA GGCACTGTAA TGGCGTTGAC TGGCGTCAGA AGCTGGACTC 1560 

TCAGCGAGGG GCTGTCATTG CCACGGAGCT GAAGAACAAC AGCTACAAGT TGGCCCGGTG 1620 

GACCTGCTGT GCTTTGCTGG CTGGATCTGA GTACCTCAAG CTTGGTTATG TGTCTCGGTA 1680 

60 
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CCACGTGAAA GACTCCTCAC GCCACGTCAT 
GTTTGCCAGC CAGATCAACC TGAGCGTGGA 
.5 TGACATCTGC ATGAAGCTGG AGGAGGGCAA 
GGTCATCCGT GTCTACAGCC TCCCTGATGG 
AGAGGAGGAG GAAGAAGAGG AAGAAGAAGA 

10 

GGAGTTTGTC CTTCCACCGA GACTACGAGG 
ACTTGCTCTC TGACATTTAG CAGATGAAAT 
15 AAAAAAAAAA AAAAAAAAAN 



CCTAGGCACC CAGCAGTTCA AGCCTAATGA 1740 

GAATGCCTGG GGCATTTTAC GCTGCGTCAT 1800 

ATACCTCATC CTCAAGGACC CCAACAAGCA 1860 

CACCTTCAGC TCTGATGAAG ATGAGGAGGA 1920 

GGAAGAAACT TAAACCAGTG ATGTGGAGCT 1980 

GCCTTTGATG CTTAGTGGAA TGTGTGTCTA 2040 

AAAATATATA TCTGTTTAGT CTTAAAAAAA 2100 

2120 



20 (2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

30 GGAAGCTGAA GTCCTTCCAG ACCAGGGACA ACCAGGGCAT TCTCTATGAA GCTGCACCCA 60 

CCTCCACCCT CACCTGTRAC TCAGGACCAC AGAAGCAAAA GTTCTCACTC AAACTGGATG 120 

CCAAGGATGG GCGCTTGTTC AATGAGCAGA ACTTCTTCCA GCGGGCCGCC AAGCCTCTGC 180 

35 

AAGTCAACAA GTGGAAGAAG CTGTACTCGA CCCCACTGCT GGCCATCCCT ACCTGCATGG 240 



GTTTCGGTGT TCACCAGGAC AAATACAGGT TCTTGGTGTT ACCCAGCCTG GGGAGGAGCC 300 

40 TTCAGTCGGC CCTGGATGTC AGCCCAAAGC ATGTGCTGTG CAGAGAGGTC TGTGCTGCAG 360 

GTGGCCTGCC GGCTGCTGGA TGCCCTQGAG TTCCTCCATG AGAATGAGTA TGTTCATGGA 420 

AATGTGACAG CTGAAAATAT CTTTGTGGAT CCAGAGGACC AGAGTCAGGT GACTTTQGCA 480 

45 

GGCTATGGCT TCGCNTTCCG CTATTGCCCA AGTGGCAAAC ACGTGGCCTA CGTGGAAGGC 540 

AGCAGGAGCC CTCACGAGGG GGACCTTGAG TTCATTAGCA TGGACCTGCA CAAGGGATGC 600 

50 GGGCCCTCCC GCCGCRGCGA CCTCCAGAGC CTGGGCTACT GCATGCTGAA GTGGCTCTAC 660 

GGGTTTCTGC CATGGACAAA TTGCCTTCCC AAMAMTGAGG ACATCATGAA GCAAAAACAG 720 

AAGTTTGTTG ATAAGCCGGG GCCCTTCGTG GGACCCTGCG GTCACTGGAT CAGGCCCTCA 780 

55 

GAGACCCTGC AGAAGTACCT GAAGGTGGTG ATGGCCCTCA CGTATGAGGA GAAGCCGCCC 840 

TACGCCATGC TGAGGAACAA CCTAGAAGCT TTGCTGCAGG ATCTGCGTGT GTCTCCATAT 900 

60 
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(2) INFORMATION FOR SEQ ID NO: 162: 

-5 (i) SEQUENCE CHARACTERISTICS: 

(A) L£NGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSS: double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 
GGCACGAGAT GAGGGGCACC CAGTGCTTCT AGGGCAGGCT GGGTGGTGGT CCCCTAGGTA 60 
15 TCAGCCTCTC TTACTGTACT CTCCGGGAAT GTTAACCTTT CTATTTTCAG CCTGTGCCAC 120 
CTGTCTAQGC AAGCTGGCTT CCCCATTQGC CCCTGTGGGT CCACAGCAGC GTGGCTGCCC 180 
CCCAGGGCCA CCGCTTCTTT CTTGATCCTC TTTCCTTAAC AGTGACTTGG GCTTGAGTCT 240 

20 

GGCAAGGAAC CTTGCTTTTA GCTTCACCAC CAAGGAGAGA GGTTGACATG ACCTCCCCGC 300 
CCCCTCACCA AGGCTGGGAA CAGAGGGGAT GTGGTGAGAG CCAGGTTCCT CTGGCCCTCT 360 
25 CCAGGGTGTT TTCCACTAGT CACTACTGTC TTCTCCTTGT AGCTAATCAA TCAATATTCT 420 
TCCCTTGCCT GTGGGCAGTG GAGAGGCTGC TGGGTGTACG CTGCACCTGC CCACTGAGTT 480 
GGGGAAAGAG GATAATCAGT GAGCACTGTT CTGCTCAGAG CTCCTGATCT ACCCCACCCC 540 

30 

CTAGGATCCA GGACTGGGTC AAAGCTGCAT GAAACCAGGC CCTGGCAGCA AACCTGGGAA 600 
TGGCTGGAGG TGGGAGAGAA CCTGAACTTC TCTTTCCCTC TCCCTCCTCC AACATTACTG 660 
35 GAACTCTATC CTGTTAGGAT CTTCTGAGCT TGTTTCCCTG CTGGGTGQGA CAGAGGACAA 720 
AGGAGAAGGG AGGGTCTAGA AGAGGCAGCC CTTCTTTGTC CTCTGGGGTA AATGAGCTTG 780 
ACCTAGAGTA AATGGAGAGA CCAAAAGCCT CTGATTTTTA ATTTCCATAA AATGTTAGAA 840 

40 

GTATATATAT ACATATATAT ATTTCTTTAA ATTTTTGAGT CTTTGATATG TCTAAAAATC 900 
CATTCCCTCT GCCCTGAAGC CTGAGTGAGA CACATGAAGA AAACTGTGTT TCATTTAAAG 960 
45 ATGTTAATTA AATGATTGAA ACTTGAAAAA AAAAAAAAAA AAA 1003 



50 (2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCTTH: 2196 beise pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
60 AAGAAGCGGC ACACGGATGT GCAGTTCTAC ACAGAAGTGG GAGAGATAAC CACGGACTTG 60 
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GGGAAACATC AGCATATGCA TGACCGAGAT GACCTCTATG CTGAGCAGAT GGAACGAGAA 120 

ATGAGGCACA AACTGAAAAC AGCCTTTAAA AATTTCATTG AGAAAGTAGA QGCTCTAACT 180 

AAGGAGGAAC TGGAATTTGA AGTGCXnTTT AGGGACTTGG GATTTAACGG AGCTCCXTTAT 240 

AGGAGTACCT GCCTCCTTCA GCCCACTAGT AGTGCGCTGG TAAATGCTAC GGAATGGCCA 300 

CXrrTTTGTCG TGACATTGGA TGAGGTAGAG CTGATCCACT TTRAGCGGGT CXAGTTTCAC 360 

CTGAAGAACT TTGATATGGT AATCGTCTAC AAGGACTACA GCAAGAAAGT GACCATGATC 420 

AACGCCATTC CTGTAGCCTC TCTTGACCCC ATCAAGGAAT GGTTGAATTC CTGCGACCTG 480 

AAATACACAG AAGGAGTACA GTCCCTCAAC TGGACTAAAA TCATGAAGAC CATTGTTGAT 540 

GACCCTGAGG GCTTCTTCGA ACAAGGTGGC TGGTCTTTCC TGGAGCCTGA GGGTGAGGGG 600 

AGTGATGCTG AAGAAGGGGA TTCAGAGTCT GAAATTGAAG ATGAGACTTT TAATCCTTCA 660 

GAAGATGACT ATGAAGAGGA AGAGGAGGAC AGTGATGAAG ATTATTCATC AGAAGCAGAA 720 

GAGTCAGACT ATTCTAAGGA GTCATTGGGT AGTGAAGAAG AGAGTGGAAA GGATTGGGAT 780 

GAACTGGAGG AAGAAGCCCG AAAAGCGGAC CGAGAAAGTC GTTACGAGGA AGAAGAAGAA 840 

CAAAGTCGAA GTATGAGCCG GAAGAGGAAG GCATCTGTGC ACAGTTCGGG CCGTQGCTCT 900 

AACCGTGGTT CCAGACACAG CTCTGCACCC CCCAAGAAAA AGAGGAAGTA ACTTCTGAAC 960 

TTTGGCCCTG AGCTCCATTC TTCCTCCAGC CAACCCCTGA AAATTTTACA TGACATAGAA 1020 

ACTGTATTTT TCCTTTCGTT TTCATTTGAA GTTTTGCCAT TTGTGTTTAT GQGTTTAGGG 1080 

GGCCATTTGT GTGGACCAAT CTACTCGGGG AATTCCAGGC CCACCAGGAC ACX3TGCCAAT 1140 

GGCCCCATTC AGATGGCAAG GGAGGAGGTG TTCTTGAAGA CAGGAGGAGG CTCCCGCTGT 1200 

TAATAAATAT TGTTTCATTC TTCTCTCTTC CTGTCACCTT CTGCCAAGAC ATTGATGGCT 1260 

TCTGACATCT TATTTGGTGT CTCAAAGCTG TATTTCCAAG ACAGTGGTAC AAGGTGACCC 1320 

TTAATTACCC GTATCATGGT TCTTGACCAG CACATTCAAT CCTCCAACCT ACCCTACTGC 1380 

CATGACCTTC CGCACATCTC TAAGTTTTAT CTTTGCAATA CTCAAGGTTC TCGGAAATTT 1440 

GCTAATGGTT GTGATAAACC ATACAGCTTG AGCCAGTGAG GCAGATTGGG CTGGTGCCTT 1500 

CGTCTGAGTT TTCCTGCTTT CCTGCXTTCGT GCAGATTCTG AGGTATATCT GCTGCCTTGG 1560 

AAGACATAAG AAGCAGTGAT ACTCCCTGGC TCGGTTATTT TCTCCATACA ATGCACACAT 1620 

GGTACAATGA TAGAAGGCAA AATTGCCACT GTCTTCTTTT TTTTCTCATA TATCTAAOGA 1680 

AGATATATCA GGTTGTGCCT CATGTACXXX: TTCTAGTGAA ATGTAGAGGA AGGCTCAAAG -1740 

GAGTCAACAT TTAGATCTGG AAGGGACAAG TCATGCCTTG GGCCTAGAAT ACCCTGATGA 1800 

GAAAAGAGAA GAGGAAGGGA GGCCATATCT ACAACANCAN CCTCTCGGCA CTGCTGCTCC 1860 
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10 



TTATTTTAAC TTTGTCTTGC ATTGTCCTGT ATTTATCACA GTTTCTGTTG AACAGCTTTT 1920 

CAAGTATTTG GGGAGTTTAT CTTGCCATCC TCCCCTTCTG GTTCTCTGCA CCCACCTGTC 1980 

CCACTGCAGT TCCTTCCGTG CTCTGTGACT TTAAGAGAAG AAGGGGGGAG GGGTCCCGGA 2040 

TTTTATGTTT GTTTGTTTTT TCTCCTTAGC AGTAGGACTT GATATTTTCA ATTTTGGAAG 2100 

AACTAAAAGA TGAATAAACT GGGTTTTTTT TGTTGTTTGT TTTTGTAAAA AAAAAAAAAA 2160 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 2196 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxable 

(D) TOPOLOGV: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

GCACAGAGTC GGGCGGACGG ACAGGGAGAG GAGGAGAGGG GGTCTGCGCG CGGCCGCTAC 60 

CCAGAAGCCA GCGGACGGCA GCACGGAGTG GGCTGTCCCC GAGCCCAGCC CCGAGCGAGC 120 

CCCCCCCCCG CCCCCGMAGG ACGCGCCTYC CAGCCAGCCC GACIYCTAGG AGGAGGGGAG 180 

GCGGGAAAGC AGCTCAAGCC TCACCCACCG CCCTGCCCCC AGCCCCGCCA CTCCCAGGCT 240 

CCTCGGGACT CGGCGGGTCC TCCTGGGAGT CTCGGAQGGG ACCGGCTGTG CAGACGCCAT 300 

GGAGTTGGTG CTGGTCTTCC TCTGCAGCCT GCTGGCCCCC ATGGTCCTGG CCAGTGCAGC 360 

TGAAAAGGAG AAGGAAATGG ACCCTTTTCA TTATGATTAC CAGACCCTGA GGATTGGGGG 420 

ACTGGTGTTC GCTGTGGTCC TCTTCTCGGT TGGGATCCTC CTTATCCTAA GTCGCAGGTG 480 

CAAGTGCAGT TTCAATCAGA AGCCCCGGGC CCCAGGAGAT GAGGAAGCCC AGGTGGAGAA 540 

CCTCATCACC GCCAATGCAA CAGAGCCCCA GAAAGCAGAG AACTGAAGTG CAGCCATCAG 600 

GTGGAAGCCT CTGGAACCTG AQGCGGCTGC TTGAACCTTT GGATGCAAAT GTCGATGCTT 660 

AAGAAAACCG GCCACTTCAG CAACAGCCCT TTCCCCAGGA GAAGCCAAGA ACTTGTGTGT 720 

CCCCCACCCT ATCCCCTCTA ACACCATTCC TCCACCTGAT GATGCAACTA ACACTTGCCT 780 

CCCCACTGCA GCCTGCGGTC CTGCCCACCT CCCGTGATGT GTGTGTGTGT GTGTGTGTCT 840 

GTGACTGTGT GTGTTTGCTA ACTGTGGTCT TTGTGGCTAC TTGTTTGTGG ATGGTATTGT 900 

GTTTGTTAGT GAACTGTGGA CTCGCTTTCC CAGGCAGGGG CTGAGCCACA TGGCCATCTG 960 

CTCCTCCCTG CCCCCGTGGC CCTCCATCAC CTTCTGCTCC TAQGAGGCTG CTTGTTGCCC 1020 
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GAGACCAGCC CCCTCCCCTG ATTTAGGGAT 
CAGTCGTCTT GGGACCTGGG AAGGTTTGCA 
-5 TTCACTCCTT TAACAAAAAC CTTGCTTCCT 
TTAGCAACTG GAGATACAAA GCAAGGAGCT 
ATGCCCTTCC GTGGTTAATT TCTTCCCAGG 

10 

CCCCTTCACA GAGCGCCCGG GGATTCCAGG 
GTGTCCCCTG CATATCTTCT CAGCAATAAC 
15 ACCTTCCCTG CTTCTGAGAC TTCAATCTAC 
CCCTGCAATT GGGTCTCTGG CAGGCAATAG 
ACACCGGGAT GGATGGAGGG AGAGCAGAGG 

20 

TGGGCAGCAG AGGCAACTCC CX3CATCCTTT 
CGAGGTGGGT TGGAGACTCA GCAGGCTCCG 
25 GGTCATAACG AGAGTGGGAA CTCAACCCAG 
CGGAAACCAA CCAAACCGTG CGCTGTGACC 
TCAACAACAA CAGAAAAAAG GAATAAAATA 

30 

AAAAAAAAAA AAAAAAAAAA CTCGA 



391 

GCGTAGGGTA AGAGCACGGG CAGTGGTCTT 1080 

GCACTTTGTC ATaATTCTTC ATGGACTCCT 1140 

TATCCCACCT GATCCCAGtC TGAAGGTCTC 1200 

GGTGAGCCCA GCGTTGACGT CAGGCAGGCT 1260 

GGCTTCCACG AGGAGTCCCC ATCTGCCCCG 1320 

CCCAGGGCTT CTACTCTGCC CCTGGGGAAT 1380 

TCCATGGGCT CTGGGACCCT ACCCCTTCCA 1440 

AGCCCAGCTC ATCCAGATQC AGACTACAGT 1500 

TTGAAGGACT CCTGTTCCGT TGGGGCCAGC 1560 

CXTTTTGCTTC TCTGCCTACG TCCCCTTAGA 1620 

GCTCTGCCTG TCRGTGGTCA GAGCGGTGAG 1680 

TGCAGCCCTT GGGAACAGTG AGAGGTTGAA 1740 

ATCCCGCCCC TCCTGTCCTC TGTGTTCCCG 1800 

CATTGCTGTT CIKrrGTATCG TGATCTATCC 1860 

TCCTTTGTTT CCTAGTGAAA AAAAAAAAAA 1920 

1945 



35 

(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2933 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

45 

GGGTCGACCC ACGCGTCCGG CAGCCGTCGT TTGAGTOGTT QCTGCCGCTG CCCCCTCCCG 60 

GATCAGGAGC CAGTGTATAC CGCCCGCCCA CCGCCTTGGT GCCGCTAGAG GAAACGAGAA 120 

50 GGAGGCCGCC TGCGGTTTGT CGCCGCAGCT CGCCCMCYGY CYGGRAGAGC CGAGCCCCGG 180 

CCCAGTCGGT CGCYTGCCAC CSCTCGTAGC CGTTACCCGC GGGCCGCCAC AGCCGCCGGC 240 

CGGGAGAGGC GCGCGCCATG GCYTCTGGAG CCGATTCAAA AGGTGATGAC CTATCAACAG 300 

CCATTCTCAA ACAGAAGAAC CGTCCCAATC GGTTAATTGT TGATGAAGCC ATCAATGAGG 360 

*ACAACAGTGT GGTGTCCTTG TCCCAGCCCA AGATGGATGA ATTGCAGTTG TTCCGAGGTG 420 

60 ACACAGTGTT GCTGAAAGGA AAGAAGAGAC GAGAAGCTGT TTGCATCGTC CTTTCTGATG 480 



SOOCID: <WO 983S44aA2 1A> 
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ATACTTGTTC TGATGAGAAG ATTCGGATGA ATAGAGTTGT TCGGAATAAC CTTCGTOTAC 540 
GCCTAGGGGA TGTCATCAGC ATCCAGCCAT GCCCTGATGT GAAGTACGGC AAACGTATCX: 600 
ATGTGCTGCC CATTGATGAC ACAGTGGAAG GCATTACTGG TAATCTCTTC GAGGTATACC 660 
TTAAGCCGTA CTTCCTGGAA GCGTATCGAC CXIATCCGGAA AGGAGACATT TTTCTTGTCC 720 
GTGGTGGGAT GCX5TGCTGTG GAGTTCAAAG OXSGTGGAAAC AGATCCTAGC CCTTATTGCA 780 
TTGTTGCTCC AGACACAGTG ATCCACTGCG AAGGGGAGCC TATCAAACGA GAGGATGAGG 840 
AAGAGTCCTT GAATGAAGTA GGGTATGATG ACATTGGTGG CTGCAGGAAG CAGCTAGCTC 900 
AGATAAAGGA GATGGTGGAA CTGCXXTCTGA GACATCCTGC CCTCTTTAAG GCAATTGGTG 960 

TGAAGCCTCC TAGAGGAATC CTGCTTTACG GACCTCCTGG AACAGGAAAG ACCCTGATTG 1020 

CTCGAGCTGT AGCAAATGAG ACTGGAGCCT TCTTCTTCTT GATCAATGGT CCTGAGATCA 1080 

TGAGCAAATT GGCTGGTGAG TCTGAGAGCA ACCTTCGTAA AGCCTTTGAG GAGGCTGAGA 1140 

AGAATGCTCC TGCCATCATC TTCATTGATG AGCTAGATGC CATCGCTCCC AAAAGAGAGA 1200 

AAACTCATGG CGAGGTGGAG CGGCGCATTG TATCACAGTT GTTGACCCTC ATGGATGGCC 1260 

TAAAGCAGAG GGCACATGTG ATTGTTATGG CAGCAACCAA CAGACCCAAC AGCATTGACC 1320 

CAGCTCTACG GCX5ATTTGGT CGCTTTGACA GGGAGGTAGA TATTGGAATT CCTGATGCTA 1380 

CAGGACGCTT AGAGATTCTT CAGATCCATA CCAAGAACAT GAAGCTGGCA GATGATGTGG 1440 

ACCTGGAACA GTAGCCAATG AGACTCACGG GCATGTGGGT GCTGACTTAG CAGCCCTGTG 1500 
CTCAGAGGCT^ GGTCTGCAAG -CCATCCGCAA- GAAGATGGAT -CTCATTGACC TAGAGGATGA . 1560- - 

GACCATTGAT GCCGAGGTCA TGAACTCTCT AGCAGTTACT ATGGATGACT TCCGGTGGGC 1620 

CTTGAGCCAG AGTAACCCAT CAGCACTGCG GGAAACCGTG GTAGAGGTGC CACAGGTAAC 1680 

CTGQGAAGAC ATCGGGGGCC TAGAGGATGT CAAACGTGAG CTACAGGAGC TGGTCCAGTA 1740 

TCCTGTGGAG CACCXIAGACA AATTCCTGAA GTTTGGCATG ACACCTTCCA AGGGAGITCT 1800 

GTTCTATGGA CCTCCTGGCT GTGGGAAAAC TTTGTTGGCC AAAGCCATTG CTAATGAATG 1860 

CCAGGCCAAC TTCATCTCCA TCAAGGGTCC TGAGCTGCTC ACCATGTGGT TTGGGGAGTC 1920 
TGAGGCCAAT GTCAGAGAAA TCTTTGACAA GGCCCGCCAA GCTGCCCCCT GTGTGCTATT 1980 
CTTTGATGAG CTGGATTCGA TTGCCAAGGC TCGTGGAGGT AACATTGGAG ATGGTGGTQG 2040 
GGCTGCTGAC CGAGTCATCA ACCAGATCCT GACAGAAATG GATGGCATGT CCACAAAAAA 2100 
AAATGTGTTC ATCATTGGCG CTACCAACCG GCCTGACATC ATTGATCCTG CCATCCTCAG 2160 
ACCTCGCXGT CTTGATCAGC TCATCTACAT CCCACTTCCT GATGAGAAGT CCCGTGTTGC 2220 
CATCCTCAAG GCTAACCTGC GCAAGTCCCC AGTTGCCAAG GATGTGGACT TGGAGTTCCT 2280 
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GGCTAAAATG 
CAAGCTGGCC 
AAACCCATCA 
CTTTGAAGAA 
TGAGATGTTT 
AGGGAACCAG 
ATACACAGAA 
AGCTOGCXTTG 
GGGGTGCGCC 
TGGACAGGGG 
CATTTGGTAA 



ACTAATGGCT 
ATCCGTGAAT 
GCCATGGAGG 
GCCATGCGCT 
GCXXAGACCC 
GGTGGAGCTG 
GACAATGATG 
CCTGGACCTT 
CACAGCCTGC 
GTTTCTGTTG 
AAAAAAAAAA 



TCTCTGGAGC 
CCATCGAGAG 
TAGAAGAGGA 
TTGCGCGCCG 
TTCAGCAGAG 
GCCCCAGTCA 
ATGACCTGTA 
GTTCCCTGGG 
TCCATTCTCC 
CAAAAATACA 
AAAAAAAAAT 



TGACCTGACA 
TGAGATTAGG 
TGATCCAGTG 
TTCTGTCAGT 
TCGGGGCTTT 
GGGCAGTGGA 
TGGCTAAGTG 
GGTGGGGGCG 
AGTCTGAACA 
AAACAAAAGC 
CCGGGGGGGG 



GAGATTTGCC 
CGAGAACGAG 
CCTGAGATCC 
GACAATGACA 
GGCAGCTTCA 
GGCX3GCACAG 
GTGGrTGGCCA 
CTTGCCCAGG 
GTTCAGCTAC 
GATAAAATAA 
GCCCX3AACCA 



AGCGTGCTTG 
AGAC3GCAGAC 
GTCGAGATCA 
TTCGGAAGTA 
GATTCCCTTC 
GTGGCAGTGT 
GCGTGCAGTG 
AGAGGGACCA 
AGTCTGACTC 
AAGCGATTTT 
TTT 



(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linecir 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

TCGGACSAfiCC GGCGGGCGl^ CGCCTCTCGG CCAGGAAiGOS CCTCITGGAC GOGTGTTmcC 

GATGCCCAGA AGTGGCCTTG GGCTGGGGAT CACCATAGCT TTTCTAGCTA CGCTGATCAC 

GCAGTTTCTC GTGTATAATG GTGTCTATCA GTATACATCC CCAGATTTCC TCTATATTCG 

TTCTTGGCTC CCTTGTATAT TTTTCTCAGG AGGCGTCACG GTGGGGAACA TAGGACGACA 

GTTAGCTATG GGTGTTCCTG AAAAGCCCCA TAGTGATTGA GTCTTCAAAA CCACCGATTC 

TGAGAGCAAG GAAGATTTTG GAAGAAAATC TGACTGTGGA TTATGACAAA GATTATCTTT 

TTTCTTAAGT AATCTATTTA GATCGGGCTG ACTGTACAAA TGACTCCTGG AAAAAACTCT 

TCACCTAGTC TAGAATAGGG AGGTGGAGAA TGATGACTTA CCCTGAAGTC TTCCCTTGAC 

TGCCCGCACT GGCGCCTGTC TGTGCCCTGG AGCATTCTGC CCAGGCTACG TGGGTTCAGG 

CAGGTGGCAG CTTCCCAAGT ATTCGATTTC ATTCATGTGA TTAAAACAAG TTGCCATATT 

TCAAAGCCTT GAACTAAGAC TCAATTACCA ACCCGCAGTT TTGTGTCAGT GCCCAAAGGA 

GGTAGGTTGA TGGTGCTTAA CAAACATGAA GTATGGTGTA ATAGGAATAA TATTTATCCA 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2933 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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AAAGATTTTT AAAAATAGGG CTGTGTTTAA AAAAAAAAAC AAAACARGAA AAGCAGCAGT 780 

GATTATAGAG AGGTCACACT CTAAGTGGGG TCGCGGCGTG GCCACGCTTC ACGGTCACGC 840 

TCGTCCGTCC TGCAGTGGCG TGTTTACATG GTCACACX?rG TGTGTATCAC CAGTGGGTCA 900 

ACTGCTTGTC ATTCCTCCCG TGGCAGTTTG TGTAGACAAT CTTACTGAGC AAAACSGCAAT 960 

GAAAAGTCTT GGTTCCCACA CTGCX3ATATA TTGGAATTTT CACCTCAGTT TATGAAGTTT 1020 

ATTTCGAAAT CCATAGTCAT CTAAGAATGA ATACCTGTCT GCCATGTATT TCAATCTTAG 1080 

TGAGCCAAAA TrGTTTGTTT GTTACTACAG AATAGAGATG ACTGTTTTTT GCCACAGCCC 1140 

TATGGRATTT GCAATCTGTG ATTGCCTTGT AAAAAGGAGA GTGCATATGG CACTGCATTA 1200 

AACGTGTGGT GTTTCTAGTC AATGATATTG GTGAGCACAA TGTATTCATT TAATGGCATA 1260 

GACCATACCA GACCTAATTT GCAAGTATTG GGTCTTAAAC TTCAAGTGCA ATGTATATGA 1320 

AAACCAATCT GAGCCTTGTA TCTCTTAAAT ATTTATTTTT TTTAACGTGT GAGATGTTCG 1380 

AGAGAAGGTT CTCCATTCAT TTCAGTGCTG CCTQGAGGAA ACTOGGCAAT GATTTCTTTC 1440 

AGTTGTGAAG TTCCTTTCGT GTTACACCCT CCACTGAACC CTCAACCTTC GAAATACTCC 1500 

AGTTTTGTGG GTTTGGTCAT TTTTACTTAT AAATTTACCT TTTTGTATTT TOCAATTTAC 1560 

ATGTGTTTGG TTTGTTTTAA ATTCTGTGAA AGTGGCTTGA TTAAAAGACT CCTTTTAAAT 1620 

GGAAGCCACC AGTCAGCAGA ATGGAAGCTT AGAGGAACTT GCCTGTGAGC GCTGGTCTTT 1680 

GTGTTTGGTT TTGTGATGTA ACX3ATCTTTG CTGGGGTTTT TTGCTTTGTT TTGAGGGAAA 1740 

TGTCITGGAG TAAATTTTAA GTTCCTGGAG TTAATTTGTT TTACAGGAAT TTTGTTrTTT 1800 

AAAAAAATAG GATCATTCTG AACTTTGGAA TGACCCCCTT ATATATTTTC TGAAAATGAA 1860 

AACAGTTACA TGAAAAAAAT TTCCAATGAA GATGTCAGCA TTTTATGAAA AACCAGAAGT 1920 

TATTAGATGA AAGCAGCGAG TGAATCTTTA AAACAGACTT GPlTCACGCAC ACACAATAAG 1980 

TCTTTCTCTC CGAAACCGGA AGTAAATCTA TATCTGTTAG AAATAATGTA GCCAAAAGAA 2040 

TGTAAATTTG AGGATTTTTT TGCCAATAGT TTATAGAAAA TATATGAACC AAAGTGATTT 2100 

GAGTTTGTAA AAATGTAAAA TAGTATGAAC AAAATTTGCA CTCTACCAGA TTTGAACATC 2160 

TAGTGAGGTT CACATTCATA CTAAGTTTTC AACATTGTGT TCTTTTTGCA TTCATTTTTT 2220 

ACTTTTATTA AAGGTTCAAA ACC 2243 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 1816 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : doi±)le 

(D) TOPOliOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

-5 

GGTGGGNAGC TTTNAATTTC CCCTTACWGG GGCGCTOTAA GGGGAAACCT TCCCGGAATT 60 

TTCGGGTCGA CCCACGCGTC CGGCCAGCCT AGGAGAAGAA GrTTCGTAGTC CCAGAGGTGA 120 

10 GGCAGGAGGC GGCAGTTTCT GGCGGGTGAG GGCGGAGCTG AAGTGACAGC QGAGGCGGAA 180 

GCAACGGTCG GTGGGGCGGA GAAGGGGGCT GGCCCCAGGA GGAGGAGGAA ACCCTTCCGA 240 

GAAAACAGCA ACAAGCTGAG CTGCTGTGAC AGAGGGGAAC AAGATGGCGG CGCCGAAGGG 300 

15 

GAGCCTCTGG GTGAGGACCC AACTGGGGCT CCCGCCGCTG CTGCTGCTGA CCATGGCCTT 360 

GGCCGGAGGT TCGGGGACCG CTTCGGCTGA AGCATTTGAC TCGGTCTTGG GTGATACGGC 420 

20 GTCTTGCCAC CGGGCCTGTC AGTTGACCTA CCCCTTGCAC ACCTACCCTA AGGAAGAAGA 480 

GTTGTACGCA TGTCAGAGAG GTTGCAOGCT GTTTTCAATT TGTCAGTTTG TGGATGATGG 540 

7ATTGACTTA AATCGAACTA AATTGGAATG TGAATCTGCA TGTACAGAAG CATATTCCCA 600 

25 

ATCTGATGAG CAATATGCTT GCCATCTTGG KTOCCAGAAT CAGCTGCCAT TCGCTGAACT 660 

GAGACAAGAA CAACTTATGT CCCTGATGCC AAAAATGCAC CTACTCTTTC CTCTAACTCT 720 

30 GGTGAGGTCA TTCTGGAGTG ACATGATGGA CTCCGCACAG AGCTTCATAA CCTCTTCATG 780 

GACTTTTTAT CTTCAAGCCG ATGACGGAAA AATAGTTATA TTCCRGTCTA AGCCCAGRAA 840 

TCCCAGGTAC GCACCACATT TQGAGCCAGG AGCCCTACCA AATTTGRGRG RAWCMTCTCT 900 

35 

AAGCAAAATG TCCNTCAKMT CGSMAATGAG AAATTCACAA GCGCACAGGA ATTTTCTTGA 960 



AGATGGAGAA AGTGATGGCT TTTTAAGATG CCTCTCTCTT AACTCTGGGT GGATTTTAAC 1020 

40 TACAACTCTT GTCCTCTCGG TGATGGTATT GCTTTGGATT TGTTGTGCAA CTTGTTGCTA 1080 

CACGCTGTTG GACGCAGTAT AGTTTCCCTC TGAGAAGCTG AGTATCTATG GTGACTTGGA 1140 

GTTTATGAAT GAACAAAAGC TAAACAGATA TCCAGCTTCT TCTCTTGTGG TTGTTAGATC 1200 

45 

TAAAACTGAA GATCATGAAG AAGCAGGGCC TCTACCTACA AAAGTGAATC TTGCTCATTC 1260 

TGAAATTTAA GCATTTTTCT TTTAAAAGAC AAGTGTAATA GACATCTAAA ATTCCACTCC 1320 

50 TCATAGAGCT TTTAAAATGG TTTCATTGGA TATAGGCCTT AAGAAATCAC TATAAAATGC 1380 

AAATAAAGTT ACTCAAATCT GTGAAAAAAA AAAAAAAAAA AAAAAAAAAC TCGAGGGGGG 1440 

GCCCCyiTACC AAKTCGCCCT ATOGTGADrTB GTATTMTTAT TTTACTAATA TCTGTAGCTA 1500 

55 

TTrTGTTTTT KGCTTKGGTT ATKGTTTTTY TCCCTTYTCT WAGCTATRAG CTGATCATKG 1560 

CYSCTTCTCA CCTCCTGCCA TGATACTGTC AGTTACCTTA CTTTAACAAGC TGAATATTTA 1620 

60 GTAGAAATGA TGCTTCTGCT CAGGAATGGC CCACAAATCT GTAATTTGAA ATTTAGCAGG 1680 
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AAATGACCTT TAATGACACT ACATTTTCAG GAACTGAAAT CATTAAAATT TTArTTGAAT 1740 

AATTATGTGC TGAAAAAAAA AAAAAAAAAA AMWMRARASK RRWWACTCGA GGGGGGGCCX: 1800 

-5 

GGTACCCNAT TCGCCG 1816 



10 

(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDGEENESS: double 
<D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

20 

AGAAACCGTT GATGGGACTG AGAAACCAGA GTTAAAACCT CTTTGGAGCT TCTGAGGACT 60 

CAGCTGGAAC CAACGGGCAC AGTTGGCAAC ACCATCAACT TCTCCCAAGC AGAGAAACCC 120 

25 GAACCCACCA ACCAGGGGCA GGATAGCCTG AAGAAACATC TACACGCAGA AATCAAAGTT 180 

ATTGGGACTA TCCAGATCTT GTGTGGCATG ATGGTATTGA GCTTGGGGAT CATTTTGGCA 240 

TCTGCTTCCT TCTCTCCAAA TTTTACCCAA GTGAdTCTA CACTGTTGAA CTCTGCTTAC 300 

30 

CCATTCATAG GACCCTTTTT TTTTATCATC TCTGGCTCTC TATCAATCGC CACAGAGAAA 360 

AGGTTRACCA AGCTTTTGGT GCATAGCAGC CTGGTTGGAA GCATTCTGAG TGCTCTGTCT 420 

35 GCCCTGGTGG GTTTCATTAT CCTGTCTGTC AAACAGGCCA CCTTAAATCC TGCCTCACTG 480 



CAGTGTGAGT TGGACAAAAA TAATATACCA ACAAGAAGTT ATGTTTCTTA CTTTTATCAT 540 

GATTCACTTT ATACCACGGA CTGCTATACA GCCAAAGCCA GTCTGGCTGG AWCTCTCTCT 600 

40 

CTGATGCTGA TTTGCACTCT GCTQGAATTC TGCCTAGCTG TGCTCACTGC TGTGCTGCGG 660 

TGGAAACAGG CTTACTCTGA CTTCCCTGGG AGTGTACTTT TCCTGCCTCA CAGTTACATT 720 

45 GGTAATTCTG GCATGTCCTC AAAAATGACT CATGACTGTG GATATGAAGA ACTATTGACT 780 

TCTTAAGAAA AAAGGGAGAA ATATTAATCA GAAAGTTGAT TCTTATGATA ATATGGAAAA 840 

GTTAACCATT ATAGAAAAGC AAAGCTTGAG TTTCCTAAAT CSTAAGCITTT AAAGTAATGA 900 

50 

ACATTAAAAA AAACCATTAT TTCACTGTCA TTTAAAGATA ATGTG 945 



55 

(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 902 base pairs 
60 (B) TYPE: nucleic acid 



wo 98/39448 PCTAJS98/04493 

397 

(C) STRANMDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

-5 

GGCAGAGCCA CAGGAAGGAT GAGGAAGACC AGGCTCTGGG GGCTGCTGTG GATGCTCTTT 60 

GTCTCAGAAC TCCGAGCTGC AACTAAATTA ACTGAGGAAA AGTATGAACT GAAAGAGGGG 120 

10 CAGACCCTGG ATGTGAAATG TGACTACACG CTAGAGAAGT TTGCCAGCAG CCAGAAAGCT 180 

TGGCAGATAA T7AGGGACGG AGAGATGCCC AAGACCCTGG CATGCACAGA GAGGCCTTCA 240 

AAGAATTCCC ATCCAGTCCA AGTGGGGAGG ATCATACTAG AAGACTACCA TGATCATGGT 300 

15 

TTACTGCGCG TCCGAATGGT CAACCTTCAA GTGGAAGATT CTGGACTGTA TCAGTGTGTG 360 

ATCTACCAGC CTCCCAAGGA GCCTCACATG CTGTTCGATC GCATCCGCTT GGTGGTGACC 420 

20 AAGGGTTTTT CAGGGACCCC TGGCTCCAAT GAGAATTCTA CCCAGAATGT GTATAAGATT 480 

CCTCCTACCA CCACTAAGGC CTTGTGCCCA CTCTATACCA GCCCCAGAAC TGTGACCCAA 540 

GCTCCACCCA AGTCAACTGC CGATGTCTCC ACTCCTGACT CTGAAATCAA CCTTACAAAT 600 

25 

GTGACAGATA TCATCAGGGT TCCGGTGTTC AACATTGTCA TTCTCCTGGC TGGTGGATTC 660 

CTGAGTAAGA GCCTGGTCTT CTCTGTCCTG TTTGCTGTCA CGCTGAGGTC ATTTGTACCC 720 

30 TAGGCCCACG AACCCACGAG AATGTCCTCT GACTTCCAGC CACATCCATC TGGCAGTTGT 780 

GCCAAGGGAG GAGGGAGGAG GTAAAAGGCA GGGAGTTAAT AACATGAATT AAATCTGTAA 840 

TCACCRGCTA AAAAAAAAAA AAAAAAAACN CGANCCTNGG TTTTCAGCTC CATCAGCTCC 900 

TT 902 



35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 170: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1883 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

AGAAAACAAC TGAAAAACCA CATTTTTCTA CATACAGCTG GGGAGGTAGC TGAGAACTTG 60 

GCACTGCGCA CACATACTAG GTTGAAAGAG AGTTGAGGAA ACCAGAAGGC CAAGTGGATC 120 

55 TGCTGGCAAA CCCTGAACCT GTCTCCTGCG CTTGCTCTAC AGTTCTGAAG TTGAAAATCC 180 

TTTTCATGCC TAGCATCTGC TTGAGTTATA AACCCCAAGG CAGCCATGTC ATAGACTAGT 240 

GTTTACTCTT GTTTTGACTT TGTTTTAATG CTTCCTAAGA CCCAAGTGCC TCCTGCTGTT 300 

60 
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TCCTCCTTTG TGGTAGCCTC TGGCCATCTG GGACCTCAAT CCCCAGCTTT CCCACTTTCA 360 

GCAGTCCTTT GCTCTCTTTG CTTCTACCTC AAATAGCCCC AGGAGTGGGC TTTAGTCTCC 420 

AATATGGAGC ATYTCAAGCT TCTCCTGGGG GATGGGGATT GGGATGGGCA GAATCTGTTT 480 

TGGWTCTCCG GGTTATTTCC AGTGGGTGTA AAAGCAGAGC TGGGCCTTTC CCTCTCTTAT 540 

CCCTGAGGGT GGGTAAGAAG GACTGTATCT ACACCTGTTC TTCCCTACCT TCTCTTTTGT 600 

TAGGGAGGCC TCATTCTAAG TTCCTCAAGA GAGTCCTTGG CTTAAAGCTG TAGCAAGGGT 660 

GTGCTAGGTG GGGGATTTGG AGCAAAACCG TCGAGTAGGC ATGATACTGG TATGGAGTGG 720 

GCCTGCAAAA TCAGACAGAA ATGGCTTGAG AAGCCGCAGG QGAGCATGCC TGTCTCTCAG 780 

TGATAGAGTA TGGGAGGGAC CTCCCTAGCT TGGAAAATGA GAATTGAAGG QGTTATGAAC 840 

AAATAGGATG CCTAGTTGAG GATGTTCCCA AAGTTTTGTC CAATCTTATC ATTAGTAGAT 900 

TTTATAAGCC ACAGAGACAA ACCAGAAACG GAATAATGTT ACTTTGGATG CTTTATTTTT 960 

TTCTTCTAGG TGTGGCTTTG TACATGCAGA AGAATGCTAT ATGCTGCACA TTTTGCCTTT 1020 

AAAGTCTTAC GACTTTCCCC ATTTTAGTCT AATGGGAAGA TACAGATGTG CAAGTCTGCT 1080 

TTTTTGTTTT TTGTTATTAT TTTTTTTTTT TTGCTCTGTG TTATGGACAT TTTCAGACAT 1140 

GCACAGAAGT GGAGAGGATG GTCCTTGGAC CCCATGTGTC CATCACCTAG CTGCATCACT 1200 

TATCAGCTAT GGTCAACCTG GTTTCATCTG TATCTCTCTC TTTTCACCTG TATTGTTTAT 1260 

TGAAAATCCA AGACACTATG CCAATGCAAC CX3TGACTACT TTGGGAGAIT GGTAGTCTCT 1320 

TTTGATGGTG ATAGTGATGG GGTGCACTAT CATAATCACA TCAGGTCTGC TTTTTGCTTT 1380 

TAATGTTAAC TAATGAAGTT CCAGAGATGG GCCTTAQAAA TGTGTTTTAA GAATTAACAA 1440 

GGAGTCTCAA AAAGAAATGA GAGGGATGCT TCCTTTCCCC TTGCATCTAC AAAACAAGAG 1500 

AGAGACTGTT CTGTTGTAAA ACTCTTTCAA AAATTCTGAT ATGGTAAGGT ACTTGAGACC 1560 

CTTCACC^GA ATGTCAATCT TTTTTTCTGT GTAACATGGA AACTTGTGTG ACCATTAGCA 1620 

TTGTTATCAG CTTGTACTGG TCTCATAACT CTGGTTTTGG AAGAATAATT TGGAAATTGT 1680 

TGCTGTGTTC TGTGAAAATA ACCTCCCCAA AATAATTAGT AACTGGTTGT TCTACTTGGT 1740 

AATTTGACAC CCTGTTAATA ACGCAATTAT TTCTGTGTTC TTAAACAGTA TAAATAGTTG 1800 

TAAGTTTGCA TGCATGATGG AAAAATAAAA ACCTGTATCT CTGTTAAAAA AAAAAAAAAA 1860 

AAAAAAAAAA AAAAAAAAAA AAA 1883 



(2) INFORMATION FOR SEQ ID NO: 171: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDTOESS: double 

(D) TOPOLOGY: linear 

-5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 



TACTTTTAGA TTTACTGCCT TCAAAAAGTG CCTATTCTGA GCAACATAAA CGTTATTCCT 60 

10 TACATATGTA TGTACACACG GTACCCAGAG TCGTACTGTG GCAGCCTTCA AAAACATACC 120 

ATCAGAAAGA GTAGGTGCTG AGATAAGGNA ACTTTGCCAA ATGNAAGAAA GTCACTCACT 180 

TCCAATATCC CCTCTTCAAG CGGCTACCGT GRAASGQGCT GCAAACACAT TCCCTGAGCA 240 

15 

TCCCTTGCTG ATACAGCTTC TTTATArTTA TATCCTACTG GATGGTAGCA TATTGCTAAG 300 

GTTTCCTGTA CTCTGCTTCA AGGGAATGTA AGYTTTATGG CATTGAAACA TTTAGGAAAA 360 

20 AAAAAGATGT TTAAGAGAAT TAATAGAGCC GTAGTCTGTA TTAGGATGTG TGTCATATGT 420 

GTGTTCTATA T^CTAAGCAT CGGTGGGTTT AGAGTGTTAA AGTGTCAGCA CATTCCTTCT 480 

CCTTTTGTCT CTCAGGCTAA CATGAGAGAA AATAGAAAAG TCTTGGCTGT GGGGATTGGA 540 

25 

AGCTCAGGGG GCCAAATGTC dTGCCAGAT CCTTAGAGCA TTACTTTGAG TCCTAAAAAT 600 

AGTAGTGTAT GTTATTTGAT GGCTTTTGTT TCCATAGTTC CATCACTGAC AAAACTGTCA 660 

30 ATACTGTTGA TGGAGCAGCA GCATAGCCTA GAGTGATGCA TTCTTACCCA GAGGTGGCAA 720 

TAGGAGAGGG TCCATGTAAA TAGGACGAGG TAGACAGTGC ATGATTGTAG GAGAAGGGTT 780 

GAAGGGAGGA CATGATTCCA AAAAAGATCG TTCTCAATGT GTCGTCTGAC TCAACCAGCT 840 

35 

GGCAGATTAC ACTTGCCAAG TCGTTCCCTT TCCTTCTAAG TCAGTTGGCT CCATATTCAC 900 

TTGAATATGC CTCTGTTTGG GCAAAGCAAG ATACCTCCAC TTAACCTTTA TCCAAGGAAG 960 

40 CTCTTGGTGT CCTCTTGGTC ATAAAGTTGT CTCCTACCTA ACCCAGTTTT ACCAAATGGA 1020 

AGTAAAAGGG GACAAACTAT GGAAGATGGA CTCCATGCCA TTGCAGTCAG CCACCAITCT 1080 

CTTTTCCATA TAAGGAGCCC CATTACATAA GCTACGGGTG AGGTTGGAAC AGCTATGTTT 1140 

45 

CATAATTTCA AGAGTGTGAC CACCCTGCTC TAGTCATCAT CATTGGATGA ATCCAGTTGA 1200 

CTCTTTQGCA AAAGGGTGAT ACTTTTCACT AAAAATGCCT ACTCTTCCTG TTGATGTTCC 1260 

50 TTTTCTGTTT TTACCTTGTC CAATTTCCAC ACTAGTCATT TTTTTTATTT TTTAGAGGAT 1320 

CAGATTTTAG CGCTGGAAAA TGAGTTCAAA AATTTCAGTG TAATGTCATA AGGATGTTGG 1380 

GATACAGAGA T T ITl Trr iT CCTTGGAAAC T^TGGACTG GGAAGAAACA CAGCATQGCT 1440 

55 

TTGCTCTGAG TOTCAATCTG ATGATTATGA CCATGGAAGA TAGTCTTATG TAAAGGTTAA 1500 

ATGGTGTTTA CAAGTGGATA GATAAGGCGG AGATGGTGAG AAGCCGQGTT TTCTCTATGC 1560 

60 TAAATGTGTC TACTAAGAGC AGCACTTCCT ACTAGCTAAG CACAATCATA GCCCCACCGT 1620 
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GATGAGCTGC TAGTCTGAAT AACATTCCCT GACTTAGGGA AAGGCACACA AAAACATATA 1680 

AAGAATATGT CTATTTTCAT ATGTGTGATA CTGACAGAGC CATGGTATTC CTAAAATATA 1740 

-5 

GGTTTCTCTT TTTTCTTGTA TTCTTAGCAA ATTGCATTTA TTCACTACAT TACAAACCAT 1800 



CACTGATGTA TCCAAAATAG CACACATAGT TCACTTATGAA AATAAGAGAA TAAAATCTGT 1860 
10 TATAAGCAAG TGATTTAGGT ATTTTCTTTT GTGTTTATGC ATTATCTGAC TATATTAAAA 1920 



CCTGITTTTC TATTTACCTT CTATCAGTTT TCTCTACCAA TTATGTTTTT TCAATGCTCT 1980 

ATAAGAATGA ATATGGAAAT TATATTTCTT TTTTCTGTAA AAGAGTTC3CA ACTACTTTAT 2040 

15 

TATATTTAGA AATCCAATAA ACTTCTTATT ACATTTAAAA AAAAAAAAAA AAAACTCGAA 2100 



20 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I-ENGTH: 1930 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEI3NESS: double 

(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

30 

CCTTTGANTG TGGTCCCGGG TGCNGATTGG CAGCGCCTCC GCCC5CGGCTC GTGGTTGTCC 60 
CGCCATGGCA CTGTCGCGGG GGCTGCCCCG GGAGCTGGCT GAGGCGGTGG CCGGGGGCCG 120 
35 GGTGCTGGTG GTGGGGGCGG GCGGCATCGG CTGCGAGCTC CTCAAGAATC TCGTGCTCAC 180 



CGGTTTCTCC CACATCGACC TGATTGATCT GGATACTATT GATGTAAGCA ACCTCAACAG 240 



ACAGTTTTTG TTTCAAAAGA AACATGTTGG AAGATCAAAG GCACAGGTTG CCAAGGAAAG 300 

40 

TCTACTGCAG TTTTACCCGA AAGCTAATAT CGTTGCCTAC CATGACAGCA TCATGAACCC 360 



TGACTATAAT GTGGAATTTT TCCGACAGTT TATACTQGTT ATGAATGCTT TAGATAACAG 420 
45 AGCTGCCCGA AACCATGTTA ATAGAATGTG CCTC3GCAGCT GATGTTCCTC TTATTGAAAG 480 



TGGAACAGCT GGGTATCTTG GACAAGTAAC TACTATCAAA AAGGGTGTGA COGAGTGTTA 540 



TGAGTGTCAT CCTAAGCCGA CCCAGAGAAC CTTTCCTGGC TGTACAATTC CTAACACACC 600 

50 

TTCAGAACCT ATACATTGCA TCGTTTGGGC AAAG!TACTTG TTCAACCAGT TGTTTGGGGA 660 

AGAAGATGCT GATCAAGAAG TATCTCCTGA CAGAGCTGAC CCTGAAGCTG CCTGGGAACC 720 

55 AACGGAAGCC GAAGCCAGAG CTAGAGCATC TAATGAAGAT GGTGACATTA AACGTATTTC 780 

TACTAAGGAA TGGGCTAAAT CAACTGGATA TGATCCAGTT AAACTTTTTA CCAAGCmT 840 



TAAAGATGAC ATCAGGTATC TGTTGACAAT GGACAAACTA TGGCGGAAAA GGAAACCTCC 900 

60 
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AGTTCCGTTG GACTGGGCTG AAGTACAAAG TCAAGGAGAA GAAACGAATG CATCAGATCA 960 

ACAGAATGAA CCCCAGTTAG GCCTGAAAGA CCAGCAGGTT CTAGATGTAA AGAGCTATGC 1020 

-5 ACX3TCTTTTT TCAAAGAGCA TCGAGACTTT GAGAGTTCAT TTAGCAGAAA AGGGGGATGG 1080 

AGCTGAGCTC ATATGGGATA AGGATGACCC ATCTGCAATG GATTTTGTCA CCTCTGCTGC 1140 

AAACCTCAGG ATCCATATTT TCAGTATGAA TATGAAGAGT AGAITTGATA TCAAATCAAT 1200 

10 

GGCAGGGAAC ATTATTCCTG CTATTGCTAC TACTAATGCA GTAATTGCTG GGTTGATACT 1260 

ATTGGAAC3GA TTGAAGATTT TATCAGGAAA AATAGACCAG TGCAGAACAA TTTTTTTGAA 1320 

15 TAAACAACCA AACCCAAGAA AGAAGCTTCT TGTGCCTTGT GCACTGGATC CTCCCAACCC 1380 

CAATTGTTAT GTATGTGCCA GCAAGCCAGA GGTGACTGTG CGGCTGAATG TCCATAAAGT 1440 

GACTCTTCTC ACCTTACAAG ACAAGATAGT GAAAGAAAAA TTTGCTATGC TAGCACCAGA 1500 

20 

TGTCCAAATT GAAGATGGGA AAGGAACAAT CCTAATATCT TCCGAAGAGG GAGAGACGGA 1560 

AGCTAATAAT CACAAGAAGT TGTCAGAATT TGGAATTAGA AATGGCAGCC GGCTTCAAGC 1620 

25 AGATGACTTC CTCCAGGACT ATACTTTATT GATCAACATC CTTCATAGTG AAGACCTAGG 1680 

AAAC5GACGTT GAATTTGAAG TTGTTGGTGA TGCCCCGGAA AAAGTGGGGS CCAAACAAGC 1740 

TGAAGATGCT GCCAAAAGCA TAACCAATGG GCAGTGATGA TGGGAGCTTC AGCCCTCCAC 1800 

30 

CTYCACAGCT TCAAGGAGGC AAGATGGACG TYTCYCATAG TTGATYCX3GR TGAAGAAGRT 1860 

TCTCCAATAA TTGCCCGACG TTCATTGAAG GAAGGAGGAG GAGGCCCGCC AAGAGGGGAA 1920 

35 TTTAGGNTTG 1930 

40 (2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1509 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: doxible 

(D) TOPOLOGV: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



50 GGCCCTGGCC TCTGGGCTGA GGCTTGCTAG GGACTCGGGG TGGCTCTAAG GGGCAGGGAT 60 

AGGGCTGGGG AGCGCCGGCC TGTGGCCCTG ACCAGCCCCT TCTCGTGCRG GTTCCACCCC 120 

GATGCAQGTG GTCACGTGCT TGACGCGGGA CAGCTACCTG ACGCACTGCT TCCTCCAGCA 180 

55 

CCrCATGGTC GTGCTGTCCT CTCTGGAACG CACGCCCTCG CCGGAGCCTG TTGACAAGGA 240 

CTTCTACTCC GAGTTTGGGA ACAAGACCAC AGGGAAGATG GAGAACTACG AGCTGATCCA 300 

60 CTCTAGTCX5C GTCAAGTTTA CCTACCCCAG TGAGGAGGAG ATTGGGGACC TGACGTTCAC 360 
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TGTGGCCCAA AAGATGGCTG AGCCAGAGAA GGCCCCAGCC CTCAGCATCC TC3CTGTACGT 420 

GCAGGCCTTC CAGGTXX3GCA TGCCACCCCC TGGGTGCTGC AGGGGCCCCC TGCGCCCCAA 480 

GACACTCCTG CTCACCAGCT CCGAGATCTT CCTCCTGGAT GAGGACTGTG TCCACTACCC 540 

ACTGCCCGAG TTTGCCAAAG AGCCGCCX3CA GAGAGACAGG TACCX3GCTGG ACGATGQCCG 600 

CCGCGTCCGG GACCTGGACC GAGTGCTCAT GGGCTACCAG ACCTACCCGC AGCCCTCACC 660 

CTCGTCTTCG ATGACGTGCA AGGTCATGAC CTCATGGGCA GTGTCACCCT GGACCACTTT 720 

GGGGAGGTGC CAGGTGGCCC GGCTAGAGCC AGCCAGGGCC GTGAAGTCCA GTGGCAGGTG 780 

TTTGTCCCCA GTGCTGAGAG CAGAGAGAAG CTCATCTCGC TGTTGGCTCG CCAGTGGGAG 840 

GCCCTGTGTG GCCGTGAGCT GCCTGTCGAG CTCACCGGCT AGCCCAGGCC ACAGCCAGCC 900 

TGTCGTGTCC AGCCTGACGC CTACTGGGGC AGGGCAGCAG GCTTTTGTGT TCTCTAAAAA 960 

TGTTTTATCC TCCCTTTGGT ACXOTAATTT GACTGTCCTC QCAGAGAATG TGAACATGTG 1020 

TGTGTGTTGT GTTAATTCTT TCTCATGTTG GGAGTGAGAA TGCCXX3GCCC CTCAGGGCTG 1080 

TCGGTGTGCT GTCAGCCTCC CACAGGTGGT ACAGCCGTGC ACACCAGTGT CGTGTCTGCT 1140 

GTTGTGGGAC CGTTGTTAAC ACGTGACACT GTGGGTCTGA CTTTCTCTTC TACACGTCCT 1200 

TTCCTGAAGT GTCGAGTCCA GTCCTTTGTT GCTGTTGCTG TTGCTGTTGC TGTTGCTGTT 1260 

GGCATCTTGC TGCTAATCCT GAGGCTGGTA GCAGAATGCA CATTGGAAGC TCCCACCCCA 1320 

TATTGTTCTT CAAAGTGGAG GTCTCCCXTTG ATCCAGACAA GTGGGAGAGC CCGTGGQGGC 1380 

J«3C3GGACCTG GAGCTGCCAG -CAGCAAGCGT^ GATTCGTGGT -GGGTGTATTC TGTATTCCAA 1440 - 

TAAAGCAGAG TTTGACACCG TCAAAAAAAA AAAAAAAAAA AAAAAAAAAA ATTNCTGCQG 1500 

CXTCAAGGG 1509 



45 (2) INFOPMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3173 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
55 TCGACCCCAS GCGTCCGTGC TTTTCCACAG AAGGTTAGAC CCTGAAAGAG ATGGCTCAGC 60 
ACCACCTATG GATCTTGCTC CTTTGCCTGC AAACCTGGCC GGAAGCAGCT GGAAAAGACT 120 



60 



CAGAAATCTT CACAGTGAAT GGGATTCTGG GAGAGTCAGT CACTTTCCCT GTAAATATCC 



180 
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AAGAACCACG GCAAGTTAAA ATCATTGCTT 
CACCAGGAGA CTCAGAAACA GCACCCGTAG 
-5 GGATACATGC CTTAGGTCCG AACTACAATC 
CAGGAGACTA CAAAGCAGAC ATAAATACAC 
ACAACCTCCA AATCTATCGT CGGCTTGGGA 

10 

CTGTGAACAG CACCTGTAAT GTCACACTGA 
TGACATACAA TTGGAGTCXrC CTGGGAGAAG 
15 CTGAGGACCA AGAGCTGACT TACACGTGTA 
ACTCCATCTC TGCCCGGCAG CTCTGTGCAG 
CCGGGTTGCT GAGCGTGCTG GCTATGTTCT 

20 

TTTTGTTCCG TTTGTTCAAG AGAAGACAAG 
ATATCATGGC TTCAAGGAAC ACCCAGCCAG 
25 AGTCCAAGGT GCTTCCCTCC AAGGAAGAGC 
TTGCTGATAA GATGGGGAAA GCCAGCACAC 
ATGAAATTGT GATCTAGGCT GCTGGGCTGA 

30 

CCAATACTGG CAGGTTCCCT GGATCCAGAT 
CAAACTGCCA CATCTCAGCC TGTAAGCAAA 
35 GCCTAAATGG ACAAATGGAT GCATACCCTT 
AAAGCAGCTT'' ACCTACT GTTITCCCAA 
TAATATTTTT ATGGCACACT GGGATAAACA 

40 

ATGACTAGAG GCCTCTTGTG ACTGGAGGTA 
GGATCAATAT TTTGCACACC TGTAATAGGC 
45 CAGTCAGTAT GTGTGAAGAT CCCTGGTGCG 
GGAAAATGTA CCCTTCGCTT GAGGCAGATG 
AGCAGAATGT GGGCTGCATA TAAGCACACT 

50 

GCATAACAGG CTTAGTAAGT CCAAACACAG 
TTGTGGCTCT CAGCCATGTA GACACACTCT 
55 GCAGGGTCTA GAGACTGCTG GGACACTTTT 
ATTTTCTTTC TQGCCAT^T TTCCTTCTGT 
AGCCATGCCC AGTATTCCCA CTCTCCAAAA 

60 



403 



GGACTTCTAA AACATCTGTT GCTTATGTAA 240 

TTACTGTGAC CCACAGAAAT TATTATGAAC 300 

TX3GTCATTAG CGATCTGAGG ATGGAAGACG 360 

AGGCTGATCC CTACACCACC ACXAAGCGCT 420 

AACCAAAAAT TACACAGAGT TTAATX3GCAT 480 

CATGCTCTGT AGAGAAAGAA GAAAAGAATG 540 

AGGGTAATGT CCTTCAAATC TTCCAGACTC 600 

CAGCCCAGAA CCCTGTCAGC AACAATTCTG 660 

ACATCGCAAT GGGCTTCCGT ACTCACCACA 720 

TTCTGCTTGT TCTCATTCTG TCTTCAGTGT 780 

ATGCTGCCTC AAAGAAAACC ATATACACAT 840 

CAGAGTCCAG AATCTATGAT GAAATCCTGC 900 

CAGTGAACAC AGTTrATTCC GAAGTGCAGT 960 

AGGACAGTAA ACCTCCTGGG ACTTCAAGCT 1020 

ATTCTCCCTC TGGAAACTGA GTTACAACCA 1080 

CTTCTCTGCC CAACTCTTAC TGGGAGATTG 1140 

GCAGGAAACC TTCTGCTGGG CATAGCTTGT 1200 

CCTGAAATGA CTCCCTTCTG AATGAATGAC 1260 
ACTTCriTCCC ATX^ ' i320' 

AGCAAGATTG CTCACTTCTG GAAGCTGCAT 1380 

ACAACCCTGC CCAGTAACTG TGGGAGAAGG 1440 

CATGGCACAC CAGCCAAGAT GCTCTGCTCA 1500 

TGGCCTTCAC CACGCATCTT GAGCAAATTA 1560 

CAGCCCTTCC CCCGAGTGCA TCGCTTGGAG 1620 

CATCCCTTTG TCTGGGAATC TTTGTGCAGG 1680 

ATGACAGTGC TGTGTGGGTC TCTGTCAGAG 1740 

CCAAATGGAG TGTTGGAAAA TGTTCTTTCT 1800 

CTTGGAGTGC TACTTCAGAA GCCTTATAGG 1860 

ATCACTCCAA GCAGCCTCAG CAGAAGAAGC 1920 

GGAACTGACC AGCTTATATT TCTCACACTT 1980 
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CTGGGGAACT GGGTATAATC CAACCATCAA AATAGAAGAC CTTGCAAGAA GCAGAGTCAT 2040 

TCTCCAGAAG GAACTTGGGA GATGATGGTG CAGATGATGA AACTGGGTTC ATCCCAGTTC 2100 

CAAAGACTCA GAGAACTAGA GTTTAAGCTG AGGCAGAGTG CXTGCTACCCT GGCATGCCCC 2160 

ACAAACAGAT CACCAGCCAG CTTACACAGG CATTAACTCT CXTTCAATGAG GAAGAATCAT 2220 

TCACAACTGA GCAAGACATT CATATGATCA TTTAAGGAAG TGTTTCCCTT ATGTGTTAGC 2280 

AAGTATAATC GGCTAACTCC TAAATCCCAA TGAATAGTCC TAGGCTGGAC AGCAATGGGC 2340 

TGCAATTAGG CAGATAAAGA CATCAGTCCC AGTAAATGAA TCCATAGACT CATCTAGCAC 2400 

CAACTACCAT TAGCACTATG TTAGGAGCTG CAAGGCCCCA AAGTAGAAGA TGTGCATAAT 2460 

GTCTGCTCTT GTGTAGCTCA GGAGACAATT CCAGCACAGA CACTACAGOT AACGCTGAAC 2520 

TGCAGCTGCA AGTAATAGCA TGAACAGTCA GAAAAATACC TTATGAGGGG GCAGGGCTGA 2580 

AGCTGGGCCT TGAAGGATGG ATGAAATTTG GATAGAGAAT GAGGAAGACA GAGGGCCTCC 2640 

AAGTGAGAGA AGCATGAAAA ATGAGCAGGG GCCTGGATCA GTGGGGTGTA TTCAGAGCAC 2700 

CTCTCCAGAT GCACCATGCA TGCTCACAGT CCCTTGCCTA TGTGTGGCAG AGTGTCGCAG 2760 

CCAGATGTGT GCCCCCACCC CATGTCCATT TACATGTCCT TCAATGCCCA CCTCAAAAGG 2820 

TACCTCTTCT GTAAAGCTTT CCCTGGTATC AGGAATCAAA ATTAATCAGG GATCTTTTCA 2880 

CACTGCTGTT TTTTCCTCTT TGGTCCTTCT ATCACTAAAA CTCATCTCAT TCAGCCTTAC 2940 

AGCATAACTA ATTATTTGTT TTCCTCACTA CATTGTACAT GTGGGAATTA CAGATAAACG 3000 

GAAGCCKGCT GGGGTGGTGG CTCACGCCTG TAATCCCAAC ACTTTGGGAG GCCAAGGCAG 3060 

GCGGATCACC TGAGGTCAGG ARTTCGAGAT TARTCTGGCC AACATGGTGA AACCCCATOT 3120 

NTACTAAAAA TACGAAATTA GCCAGGTGTG GTGGCACACA TCTGTAGTCC CAG 3173 



45 



(2) INFORMATION FOR SEQ ID NO: 175: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
50 (D) TOPOLOGY: linear 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

AAATTCGGCA CAGCTGAGAG GAGACACAAG GAGCAGCCCG CAAGCACCAA GTGAGAGGCA 60 

TGAAGTTACA GTGTGTTTCC CTTTGGCTCC TGGGTACAAT ACTGATATTG TGCTCAGTAG 120 

ACAACCACGG TCTCAGGAGA TGTCTGATTT CCACAGACAT GCACCATATA GAAGAGAGTT 180 

60 TCCAAGAAAT CAAAAGAGCC ATCCAAGCTA AGGACACCTT CCCAAATGTC ACTATCCTGT 240 
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CCACATTGGA GACTCTGCAG ATCATTAAGC CCTTAGATGT GTGCTGCGTG ACCAAGAACX: 300 

TCCTGGCGTT CTACGTGGAC AGGGTGTTCA AGGATCATCA GGAGCCAAAC CCCAAAATCT 360 

TGAGAAAAAT CAGCAGCATT GCCAACTCTT TCCTCTACAT GCAGAAAACT CTGCGGCAAT 420 . 

GTCAGGAACA GAGGCAGTGT CACTGCAGGC AGGAAGCCAC CAATGCCACC AGAGTCATCC 480 

10 ATGACAACTA TGATCAGCTG GAGGTCCACX3 CTGCTGCCAT TAAATCCCTG GGAGAGCTCG 540 

ACGTCTTTCT AGCCTGGATT AATAAGAATC ATGAAGTAAT GTCCTCAGCT TGATGACAAG 600 

GAACCTGTAT AGTGATCCAG GGATGAACAC CCCCTGTGCG GTTTACTGTG GGAGACAGCC 660 

15 

CACCTTGAAG GGGAAGGAGA TGGGGAAGGC CCCTTGCAGC TGAAAGTCCC ACTGGCTGGC 720 

CTCAGGCTGT CTTATTCCGC TTGAAAATAG CCAAAAAGTC TACTGTGGTA TTTGTAATAA 780 

20 ACTCTATCTG CTGAAAGGGC CTGCAGGCCA TCCTGGGAGT AAAGGGCTGC CTTCCCATCT 840 

AATTTATTGT GAAGTCATAT AGTCCATGTC TGTGATGTGA GCCAAGTGAT ATCCTGTAGT 900 

ACACATTGTA CTGAGTGGTT TTTCTGAATA AATTCCATAT TTTACCTAAA AAAAAAAAAA 960 

AAAAACTCGA GGGGGGGCCC GTACCCAATT T 991 



25 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 176: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiable 
(DT TOPOIjOGY: "linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

ACAGCCCTCT TCGGAGCCTG AGCCCGGCTC TCCTCACTCA CCTCAACCCC CAGGCGGCCC 60 

CTCCACAGGG CCCCTCTCCT GCCTGGACGG CTCTGCTGGT CTCCCCGTCC CCTGGAGAAG 120 

45 AACAAGGCCA TGGGTCQGCC CCTGCTGCTG CCCCTRCTGC YCCTGCTGCW GCCGCCAGCA 180 

TTTCTGCAGC CTRGTGGCTC CACAGGATCT GGTCCAAGCT ACCTTTATGG GGTCACTCAA 240 

CCAAAACACC TCTCAGCCTC CATGGGTGGC TCTGTGGAAA TCCCCTTCTC CTTCTATTAC 300 

50 

CCCTGGGAGT TAGCCAYAGY TCCCRACGTG AGAATATCCT GGAGACGGGG CCACTTCCAC- 360 

GGGCAGTCCT TCTACAGCAC AAGGCCGCCT TCCATTCACA AGGATTATGT GAACCGGCTC 420 

55 TTTCTGAACT GGACAGAGGG TCAGGAGAGC GGCTTCCTCA GGATCTCAAA CCTGCGGAAG 480 

GAGGACCAGT CTGTGTATTT CTGCCGAGTC GAGCTGGACA CCCGGAGATC AGGGAGGCAG 540 

CAGTTGCAGT CCATCAAGGG GACCAAACTC ACCATCACCC AGGCTGTCAC AACCACCACC 600 

60 
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10 



15 



20 



25 



406 

ACCTGGAGGC CCAGCAGCAC AACCACCATA GCCGGCCTCA GGGTCACAGA AAGCAAAGGG 660 

CACTCAGAAT CATGGCACCT AAGTCTGGAC ACTCCCATCA GGGTTCCATT GGCTCTCGCT 720 

GTGCTCAAAA CTCTCATTTT GGGACTCCTG TGCCTCCTCC TCTCTGC?rGG AGGAGAAGGA 780 

AAGGTAGCAG GGCX^CCAAGC AGTGACTTCT GACXAACAGA CTDGOtSGGGAG AAGC3QATCTO 840 

TATTAGCCCC GGAGGACGTG ATCTGAGACC CX3CTTCTCAG TCCTCCACAC TCGTIXXCCA 900 

TTCGCAAGAT ACATGGAGAG CACCCTGAGG ACCTTTAAAA GGCAAAGCCX5 CAAGGCAGAA 960 

GGAGGCTGGG TCCCTGAATC ACCGACTGGA GGAGAGTTAC CTACAAGAGC CTTCATCCAG 1020 

GAGCATCCAC ACTGCAATGA TATAGGAATG AGGTCTGAAC TCCACTGAAT TAAACCACTC 1080 

GCATTTGGGG GCTGTTYATT ATAGCAGTGC AAAGAGTTCC TTTATCCTCC CX^AAGGA'TOG 1140 

AAAATACAAT TTATTTTGCT TACCATACAC CCCTTTTCTC CTCGTCCACA 'TTrrCCAATX: 1200 

TGTATGGTGG CTGTCTTCTA TGGCAGAAGG TTTTOGGGAA TAAATAGCGT GANATGWTOC 1260 

TGACTOAAAA AAAAAAAAAA AAAAACTCGA ^290 



30 



35 



40 
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50 



55 
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(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2290 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : doiible 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SDQ. ID NO 

TGGGGCCCCT TTTGGATGCT CTGGGTGTTT TTGCCAAGAG 

GGAGCTCAGC ACCCTTGCTG TGGACCAGTG AAGGCTOTTC 

ATTTCCAGGC TCCAGGAGAG AGGCTGGGAG CCCCCACAGA 

AAAAAACAGT CTTTTTTTTT TTTTTGCTTT TTATTATCAA 

AGAAGGGTCC ATGATTACCA GAAACATCAA AGAC?rACTTT 

TGTTGAGGCC AGCATTGCAA TAAACAAGCT AAACTACTTA 

ACTGACATTT ACAGGAATAT ACTAGAAACG GCACTAAAAA 

AACTTGCATG CACATCATAC AGAAAAGTAA CATTPTAAAT 

GGAAGCATTA TGCCAGTATT AAGGAACAGT GCTACTCTGG 

C5GGTGTTACT CTTTCCCAAA AGACTGTCAG AGGCGTCAGT 

AAAACAAACA CACAAAAAAA TGTGTCTTAC AGTTTGTAAG 

CAAAGAGGGG TCTGGAGTTC AGTTCACGCC CGAAGCCTGC 



I 177 :_ . 

TTACAGGATG TCAAGTGTGG 60 

CAGACCAGGT GCTTCCAGAC 120 

AAGCACAGGA AAATGCAAAA 180 

AACAAAACAA ATGCCCCAGG 240 

CTACCATTTT TATTCTGTTG 300 

CATTGGACTC ATTTTCAGTA 360 

GTTTAAGAAA AGTTACGGTA 420 

ATAAAAAAGA AAAACTTCCT 480 

ATGTGACAAA TTCTGTATGT 540 

GCTGCAAAAG AACAACAACA 600 

CAAGATGACA CTGCCCAACA 660 

CCCCTCGGCC TCCAGGGGTC 720 
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ATTCAGAGTG TTCTCAAATC CAATTCCGAC ACACGACTTG TCACTACTCC TCTCCCCTTG 780 

AAAAAAGCAT GTTAGAAGCT GCCCTACAGG TCTCAGCAGT GGGACAATCT AATTGAATCA 840 

-5 

CCGCAGCCTT CTAATACAGA AGAAACGGAC GTGACTGTCA CCCTCAGCCC GCCAGCAAGG 900 

GCGCTGAGGA AGTCATTAAT CCTTCGAAAC TCTGAAAAGA AACCAGTGTT GAAGTCTCGA 960 

10 CAGAAAGCCT TAAAAAAGTG I^CAGCACCAA TGCAGCTGCT CAGTGTACCC NCCGTGGGCT 1020 

GTCAGGGTCA GTGGCTTCTT TCTAGATGAA AGGAGCAGAG GCGAGCCGAC GCCACCGTCA 1080 

CAGAGAACCA GCXTGAGAAGG AAAGGCCCCA CGATGCTCCC TGTGCGCTGC CCCCACAGCX: 1140 

15 

GGCCGCTCCC CCX3ACGGCTC ACACAGGCAG CACCTCACTG CCCTGTGGCT GGAGGGGCAT 1200 

TGCAAGGAGC GCCCCCXAGC CCCAGGCACC CCCGGCTTAG GGTGTACGTA TCACXTCAGCC 1260 

20 CTGTGCTGGC AGCACGTTAC CAACCAGCCT GCGTGAAGAC CTGTCAACTC TCGTGTGTGA 1320 

ATTCCTTAAA TTCGGTTTAA ATAGTCCATT AAAGATCTGT TTAGAAAATA CCTTTGAAAA 1380 

CGAGGGTAAC TTTAAAAAAT GGAAACTTTC AAATCCATTT ATATTTTTAT TATAAACAAA 1440 

25 

ACTTAATTAA AAGTTTAACA AACTGGCTGA AAACTCACCA AGTGTCAGAC TCACX:AGCAA 1500 

TTTAAAAAAT GATAATTTAC CAGCATCTCC TCATCAGAGT TCCCTCTCCA GTAAGGGTAT 1560 

30 ACCTACATCT GTAAGGGTCA GTGGACTCTG AATCAATTTT ATGGTTGTTT TAAAATCACC 1620 

GTGTATTAGG ATACTAATGA TAGTCCCTAT ATCCATCCAG AAATGCTGGC AGAAAGCACT 1680 

GGCCACCATA CAGGACAGAC CACACCACAG CTCCATACCC AGCGTCTGCC TGGAGGCTCC 1740 

35 

CCCACGCTGA GGTCCGGGAG AATGCCTGGT TTCAGTCATT TCCGGACTAA CTGTGACAAC 1800 

GCX3TGAGCAG GGAGCACCGT GCGAGTCTCC GGGAGGGAAT CCTCCTGGGG CCCAGAGACT 1860 

40 CCTCCACCCC TGGGGAGGGC AGACAGGCTC GGGARGGCCT GGCCAGGCCA CTGGAGGCTG 1920 

GCAGGGAGCA GGCATGTCCA CCCGCAAGCC TGGGAGGCTA ACTCTGGCAT TCCTGGCCGG 1980 

AGCCGCCATG CTCATTGGTG GGCCAGTTTG GGACATCCCC GTACTCAAAG ACCATATGGC 2040 

45 

AGCCTCTGGG AAAACAAAAC CAAAACATCA CCTTCTATTA AACTCTGTAT ATTATTATTT 2100 

TTTACAATAG AAAGTTAAAA ATCAAGACTT AGATTTACTA TACATTTTTT CTCTCAGATT 2160 

50 ACAAAGTTTA TATTATATAA CTGGGGTTCC CTAAATTGAT TTCTTTTAAA ACAGTCTTAA 2220 

AGAGACCAGA AGTGAATACA AAAGAACTAA ACAAAATAAA AAATTAGAAT GTGCTGTAGC 2280 

TGAAAGCTCT 2290 

55 



(2) INFORMATION FOR SEQ ID NO: 178: 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 

-5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

GGCACGAGCC ATGCCTQGCC TCTCCTTGAT TCTTACAGTC ACTTTGTTGG CTGTTTCTGA 60 

10 

CTCAGCAGCT ACCTGCATTG TGGCCAAAGG ATGACCTATT CCTTCTCAGG AGGGCAAAAA 120 

TGTGGAATAG TGTCTGTCCA TGCCTCTCCT CATGGGCTAC CACCTCTGCC ACCGTGGTTA 180 

15 ATCAGTAACA ACCAGGAGAG AAGCTGCTGG AACTGACCTC TGGGAACTCC CTGGGATGGT 240 

TTGGTGCAGG AATGTAGTAG GCATACACGT GGTTGCGTGG ATCTGGGCCC TCCTGATGTG 300 

AGTAGAGAGG TAAAAGGCCA CCATCTCCTT GACCTCTGGG GAACTCATCC ACAAAGAAGA 360 

20 

TGTTTCCAAG ATGCTTCTGA AGATTGCCTA AAAATAGCCG GTTTCCACCC CCGTGAATGC 420 

ATCCATTCTA GAATGCTCCT TCACCAGGAC CAGAGAACTG ATTTACAGAA GTGACATGAA 480 

25 AACATTCCAT CCCAGAATTT GCAGTAGCTC AAATTAAGTT TCTAGCTATT AAAAAGAAAA 540 

AAAAAAAAA 549 

30 

(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1509 base pairs 

(BJ.TYEE: jnucJLeic. acid . ....... 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

GGCACGAGGG CTCATTCATT CCGCGCCGGG CCTGCCAGAC ACCTGCGCCC TTCTGCAGCC 60 

GCCCGCCGCA TCCGCCGCCG CAGCCCCCAG CATGTCGGGC CCAGACGTCG AGACGCCGTC 120 

45 

CGCCATCCAG ATCTGCCGGA TCATGCGGCC AQATGATGCC AACGTGGCCG GCAATGTCCA 180 

CGGGGGGACC ATCCTGAAGA TGATCGAGGA GGCAGGCGCC ATCATCAGCA CCCGGCATTG 240 

50 CAACAGCCAG AACGGGGAGC GCTGTGTGGC CGCCCTGGCT CGTGTCGAGC GCACCGACTT 300 

CCTGTCTCCC ATGTGCATCG GTGAGGTGGC GCATGTCAGC GCGGAGATCA CCTACACCTC 360 

CAAGCACTCT GTGGAGGTGC AGGTCAACGT GATGTCCGAA AACATCCTCA CAGGTGCCAA 420 

55 

AAAGCTGACC AATAAGGCCA CCCTGTGGTA TGTGCCCCTG TCGCTGAAGA ATGTGGACAA 480 

GGTCCTCGAG GTGCCTCCTG TTGTGTATTC CCGGCANGAG CAGGAGGAGG AGGGCCGGAA 540 

60 GCGGTATGAA GCCCAGAAGC TGGAGCGCAT GGAGACCAAG TGGAGGAACG GGGACATCGT 600 
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CCAGCCAGTC CTCAACCCAG AGCCGAACAC TGTCAGCTAC AGCCAGTCCA GCTTGATCCA 660 

CCTGGTGGGG CCTTCAGACT GCACCCTGCA CX3GCTTTGTG CACGGAGGTG TGACCATGAA 720 

GCTCATGGAT GAGGTCGCCG GGATCGTGGC TGCACGCCAC TGCAAGACCA ACATCGTCAC 780 

AGCTTCCX5TG GACXKXATTA ATTTTCATGA CAAGATCAGA AAAGGCTGCG TCATCACCAT 840 

CTCGGGACGC ATGACCTTCA CGAGCAATAA GTCCATGGAG ATCGAGGTOT TGGTGGACGC 900 

CGACCCTGTT GTGGACAGCT CTCAGAAGCG CTACCX3GGCC GCCAGTGCCT TCTTCACCTA 960 

CGTGTCGCTG AGCCAGGAAG GCAGGTCGCT GCCTGTGCCC CAGCTGGTCC CCGAGACCX5A 1020 

GGACGAGAAG AAGCGCTTTG AGGAAGGCAA AGGGCGGTAC CTGCAGATGA AGGCGAAC3CR 1080 

ACAGGGCCAC GCGGASCYTC AGCCCTAGAC TCCCTCCTCC TGCCACTGGT GCCTCGAGTA 1140 

GCXyVTGGCAA CGGGCCCAGT GTCCAGTCAC TTAGAAGTTC CCCCCTTGGC CAAAAACXTCA 1200 

ATTCACATTG AGAGCTGGTG TTGTCTGAAG TTTTCGTATC ACAGTGTTAA CCTGTACTCT 1260 

CTCCTGCAAA CCTACACACC AAAGCTTTAT TTATATCATT CCAGTATCAA TGCTACACAG 1320 

TGTTGTCCCG AGCGCCGGGA GGCGTTGGGC AGAAACCXTTC GGGAATGCTT CCGAGCACGC 1380 

TGTAGGGTAT GGGAAGAACC CAGCACCACT AATAAAGCTG CTGCTTGGCT GGAAAAAAAA 1440 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1500 

AGAAAAAAN 1509 



35 



40 



45 



50 



55 
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(2) INFORMATIC5N TOR"SEQ iD^Or 180:" ~' ' 

(i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxable 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

AGCTGTATCA TAGGAAAGAT GGCCACACCG GCGGTACCAG TAAGTGCTCC TCCGGCCACG 60 

CCAACCCCAG TCCCGGCGGC GGCCCCAGCC TCAGTTCCAG CGCCAACGCC AGCACCGGCT 120 

GCGGCTCCGG TTCCCGCTGC GGCTCCAGCC TGCATCCTCA GACCCTGCQG CAGCAGCGGC 180 

TGCAACTGCG GCTCCTGGCC AGACCCCGGC CTCAGCGCAA NTCCAGCGCA GACCCCAGCG 240 

CCCGCTCTGC CTGGTCCTGC TCTTCCAGGG CCCTTCCCCG GCGGCCGCGT GGTCAGGCTG 300 

CACCCAGTCA TTTTGGCCTC CATTGTGGAC AGCTACGAGA GACGCAACGA GGGTGCTGCC 360 

CGAGTTATCG GGACCCTGTT GGGAACTGTC GACAAACACT CAGTGGAGGT CACCAATTGC 420 
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TTTTCAGTGC CGCACAATGA GTCAGAAGAT 
AATATGTATG AACTGCATAA AAAAGTTTCT 
_5 ACGGGCCATG ACATCACAGA GCACTCTGTG 
CCCCAACCCC ATCCACCTCA CTGTGGACAC 
AGCCTACX5TC AGCACTTTAA TGGGAGTCCC 

10 

TCTGACAGTG AAATACGCGT ACTACGACAC 
GACCTGCTTT AGCCCCAACA GAGTGATTGG 
15 GGCATCAGCT CGCATCCAGG ATGCCCTGAG 
GTCTGGAAAG GTGTCAGCTG ACAATACTGT 
AGTACCX3AAA ATAGTTCCCG ATGACTTTGA 

20 

TTTGATGGTG ACCTACCTGG CCAACCTCAC 
TGTAAACCTG TGAATGGACC CCAAGCAGTA 
25 CTCAGAAGTG AAGGAGAAAT GGGTTTTTTG 
GTGTGTGACT CTAATAAACG GAGCCTACCT 
SGRGGGGGGG CCCGGTCCCA TTSSCCCTTT 

30 



410 



GAAGTGGCTG TTGACATGGA ATTTGCTAAG 480 

CCAAATGAGC TCATCCTGGG CTGGTACGCT 540 

CTGNATCCAT GAGTACTACA GCCGAGAGGC 600 

AAGTCTCCAG AACGGCCGCA TGAGCATCAA 660 

OXSGGAGGACC ATGGGAGTGA TGTTCACGCC 720 

OXSAACGCATC GGAGTTGACC TGATCATGAA 780 

ACTCTCAAGT GACTTGCAGC AAGTAGGAGG 840 

TACAGTGTTG CAATATGCAG AGGATGTACT 900 

GGGCCGCTTC CTGATGAGCC TGGTTAACCA 960 

GACCATGCTC AACAGCAACA TCAATGACCT 1020 

ACAGTCACAG ATTGCACTCA ATGAAAAACT 1080 

CACTTGCTGG TCTAGGTATT AACCCCAGGA 1140 

TGGTCTTGAG TCACACTGAG ATAGTCAGTT 1200 

TTTGTAAATT AAAAAAAAAA AAAAAAACCN 1260 

NGTAATTCGT NTTACAATCC CCNGGC 1316 



35 



(2) INFORMATION FOR SEQ ID NO: 181: 



(i) SEQUENCE CHARA<n^ISTICS : 

(A) liMSTH: ' 777 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

GGCATGWKCA GACATGACTT CTATTGCCAG GCTGGTCAAG TGGCAGGGTC ATGAGGGAGA 60 

CATCGATAAG GGTGCTCCTT ATGCTCCCTG CTCTGGAATC CACCAGCGGG CTATCTGCGT 120 

TTATGGGGCT GGGGACTAGA ATTGGATGCT TCAAAACCAT CACCTGTTGG CCAACAAGTT 180 

50 TGACCCAAAG GTAGATGATA ATGCTCTTCA GTGCTTAGAA GAATACCTAC GTTATAAGGG 240 

CCATTCTATT GGGACCTGAA CTTTGAAGAC CACAMTATTG AAGAGGCGTT GCTTACCYGT 300 

TGGGGGCCAA GAGGCATGTT ACCAAACATG GYYCARGAAM YTTGGYKGGG AMCARKEOCKG 360 

55 

GKKGGGARRM CMRGGGYTTG SCAAWTTCSK KGGCMWCCYT TTAGGGTAAR RRGGGCKGTO 420 

ATTAGATTGT GGGTAAAGTA GGATCTTTTG CCCTTGCAAA TTTGCTGCCT GGGTGAATGY 480 

60 TGCTTGTTCC TTCTCMACCC CTAACCCTAG TAGTTCCTCC ACTAACTTTC TCACTAAGTG 540 
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AGAATGAGAA CTGCTGTGAT AGGGAGAGTG AAGGAGGGAT ATGTGGTAGA GCACTTGATT 600 

TCAGTTGAAT GCCTGCTGGT AGCTTTTCCA TTCTGTGGAG CTGCCGTTCC TAATAATTCC 660 

AGGTTTGGTA GCGTGGAGGA GAACTTTGAT GGAAAGAGAA CCTTCCCTTC TGTACTGTTA 720 

ACTTAAAAAT AAATAGCTCC TGATTCAAAG TAAAAAAAAA AAAAAAAAAA AAAAAAA 777 

10 



(2) INFORMATION FOR SEQ ID NO: 182: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 
GGCACAGATA ACTATGTACA TGTATTCCTT AAATGTTTTT TTAAGTTTTA TATTCTTGGC 60 
25 ACTGGTCTTC AAATGTGTAG ATGTGTGCCA GGGAGCAAAT GCCTTCTTGT TTCTGAAATT 120 
GGTCTTTTAG ACTGTTCTTT TTTCCCATCT TCTCACCTCC TGCCCCTCCT TCAGGGTACT 180 
TCCGTGGCCA GAACCCCTCC AGGTCAGAGG CAGAAGAGAA GCCTCATGGG TCACAGCAGC 240 

30 

AGATGTGGGC TGGAGATCTA TTCATTTGGT TTTGGCTTGA ATTTTCTGRA TGGTTTACTT 300 
GATCYTGGGA AAGANATATC TTGCCAGGAA AAATGATAGN CCTTGACAAT GTTGAATGAT 360 
35 CCTGCACCAC CTTGAAAGAC ATTTCTAATA TGGTTTGTCA GGCAAAGTGG TTAGTAGTCA 420 
' TTTGTGGCCT GAGGTATSAAG" TCCTCffiAAS" TCAGCAGACT T^CTGATAA'AATCCTC^ 480" 
TGCCCCTGGA CTGGGCTCTG TGAGAGTGGC CTTCTGCACT GTGCACAGTA GGTGTGAACA 540 

40 

CACCACACCT ACAGGGACCA CGTQGTQGGC TGTGGACTAG CGGCCAAGCT CCCTGCAGGC 600 
CCACTAATAG AATTCAGCTT TTAGCATGGG CTGTTTCATA CTGTTCTGAT GAAACTGATT 660 
45 TGGTTTCTTT CCTCGATACC CCTTCTGCAT TTCAGTGTTT TTGTTTAGTT TTCCTGGTTT 720 
TTAATTATAA CTACAAAATA AAATCTTTAG GCTATTCACC TTAGCTTAGT AAAAAAAAAA 780 
AAAAAAAACT C 791 

50 



(2) INFORMATION FOR SEQ ID NO: 183: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
60 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

AAATTGATTA ACAGCTTGAA AGAAGGCTCT GGTTTTGAAG GCCTAGATAG CAGCACTGCC 60 

AGTAGCATGG AGCTGGAAGA ACTTCGGCAT GAGAAAGAGA TGCAGAGGGA GGAAATACAG 120 

AAGCTGATGG GCCAGATACA TCAGCTCAGA TCCGAATTAC AGGATATGGA GGCACAGCAA 180 

10 GTTAATGAAG CAGAATCAGC AAGAGAACAG TTACAGGWTC TGCATGACCA AATAGCTGQG 240 

CAGAAAGCAT CCAAACAAGA ACTAGAGACA GAACTGGAGC GACTGAAGCA GGAGTTCCAC 300 

TATATAGAAG AAGATCTTTA TCGAACAAAG AACACATTGC AAAGCAGAAT TAAAGATCGA 360 

15 

GACGAAGAAA TTCAAAAACT CAGGAATCAG CTTACCAATA AAACTTTAAG CAATAGCAGT 420 

CAGTCTGAGT TAGAAAATCG ACTCCATCAG CTAACAGAGA CTCTCATCCA GAAACAGACC 480 

20 ATGCTGGAGA GTCTCAGCAC AGAAAAGAAC TCCCTGGTCT TrCAACTGGA GCGCCTCGAA 540 

CAGCAGATGA ACTCCGCCTC TGGAAGTAGT AGTAATGGGT CTTCGATTAA TATGTCTGGA 600 

ATT6ACAATG GTGAAGGCAC TCGTCTGCGA AATGTTCCTG TTCTTTTTAA TGACACAGAA 660 

25 

ACTAATCTGG CAGGAATGTA CGGAAAAGTT CGCAAAGCTG CTAGTTCAAT TGATCAGTTT 720 

AGTATTCGCC TGGGAATTTT TCTCCGAAGA TACCCCATAG CGCGAGTTTT TGTAATTATA 780 

30 TATATGGCTT TGCTTCACCT CTGGGTCATG ATTGTTCTGT TGACTTACAC ACCAGAAATG 840 

CACCACGACC AACCATATGG CAAATGAACC AAGCCCAGTT GTTGCAGTGA TTGGTTGTCT 900 

TTTTCTAGAC TTGGGATCTG CAAGAAQGCC AATTGCCTAA AATTTCTGAG AACAGTGCAC 960 

35 

. AAGATTATTT. TATCACTACA^GCITTTAAC_.TTTTTAAGTT_J^ J. 02 a 

AAATCTTCCA ATTTCCTTTA AATGGTAAGA GTTTCTAAAA CAGACAATAA TTTAACAAGC 1080 

40 TCAGCTCTGC TTTATCTGAG TTTAGTGGTC CTAATATATA TGTAGAGAAA GATGGTGGGG 1140 

TTGTTCACCT CTGTACAGAC CATCTGTATG TTAGGTGACA TTGATTATGG GTTATAATCA 1200 

GQGAAACTAA TTGTATTTAG TGACAAAAAT AAAAAGTTTT TTTTTTATAA TTCAGTCTGC 1260 

45 

TTTTGGATTT TCATATATTT AACTTTGCAA AAAGATTTAC TTTGTACATG TTACAGGCTT 1320 

GATTGGTGTA AATCTTTTTA TAAATACATA AATAAAAGNA AAATATGCAT TTTTCTTTTC 1380 

50 TAAAAAAAAA AAAAAAAAAA CTCGA 1405 



55 (2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1596 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO; 184: 

5 GTCATGCAGT GCGCCGGAGA ACTGTGCTCT TTGAGGCCGA CGCTAGGGGC CCGGAAGGGA 60 

AACTGCGAGG CGAAGGTGAC CGGGGACCGA GCATTTCAGA TCTGCTCGGT AGACCTGGTG 120 

CACCACCACC ATGTTGGCTG CAAGGCTGGT GTGTCTCCGG ACACTACCTT CTAGGGTTTT 180 

10 

CCACCCAGCT TTCACCAAGG CCTCCCCTGT TGTGAAGAAT TCCATCACGA AGAATCAATG 240 

GCTGOTAACA CCTAGCAGGG AATATGCCAC CAAAACAAGA ATTGGGATCC GGCGTGGGAG 300 

15 AACTGGCCAA GAACTCAAAG AGGCAGCATT GGAACCATCG ATGGAAAAAA TATTTAAAAT 360 

TGATCAGATG GGAAGATGGT TTGTTGCTGG AGGGGCTGCT GTTGGTCTTG GAGCATTGTG 420 

CTACTATGGC TTGGGACTGT CTAATGAGAT TGGAGCTATT GAAAAGGCTG TAATTTGGCC 480 

20 

TCAGTATGTC AAGGATAGAA TTCATTCCAC CTATATGTAC TTAGCAGGGA GTATTGGTTT 540 

AACAGCTTTG TCTGCCATAG CAATCAGCAG AACGCCTGTT CTCATGAACT TCATGATGAG 600 

25 AGGCTCTTGG GTGACAATTG GTGTGACCTT TGCAGCCATG GTTGGAGCTG GAATGCTGGT 660 

ACGATCAATA CCATATGACC AGAGCCCAGG CCCAAAGCAT CTTGCTTGGT TGCTACATTC 720 

TGGTGTGATG GGTGCAGTGG TGGCTCCTCT GACAATATTA GGGGGTCCTC TTCTCATCAG 780 

30 

AGCTGCATGG TACACAGCTG GCATTGTGGG AGGCCTCTCC ACTGTGGCCA TGTGTGCGCC 840 

CAGTGAAAAG TTTCTGAACA TGGGTGCACC CCTGGGAGTG GGCCTGGGTC TCGTCTTTGT 900 

35 GTCCTCATTG GGATCTATGT TTCTTCCACC TACCACCGTG GCTGGTGCCA CTdTTACTC 960 

AGrGGCAATG-TACGGTOGAT "T/2?ITCTTTT~CAC5CATGTTC"CTTCrc^^ "ATACCCAGAA 102^ 

AGTAATCAAG CGTGCAGAAG TATCACCAAT GTATGGAGTT CAAAAATATG ATCCCATTAA 1080 

40 

CTCGATGCTG AGTATCTACA TGGATACATT AAATATATTT ATGCGAGTTG CAACTATGCT 1140 

GGCAACTGGA GGCAACAGAA AGAAATGAAG TGACTCAGCT TCTGGCTTCT CTGCTACATC 1200 

45 AAATATCTTG TTTAATGGGG CAGATATGCA TTAAATAGTT TGTACAAGCA GCTTTCGTTG 1260 

AAGTTTAGAA GATAAGAAAC ATGTCATCAT ATTTAAATGT TCCGGTAATG TGATGCCTCA 1320 

GGTCTGCCTT TTTTTCTGGA GAATAAATGC AGTAATCCTC TCCCAAATAA GCACACACAT 1380 

50 

TTTCAATTCT CATGTTTGAG TGATTTTAAA ATGTTTTGGT GAATGTGAAA ACTAAAGTTT 1440 

GTGTCATGAG AATGTAAGTC TTTTTTCTAC TTTAAAATTT AGTAGGTTCA CTGAGTAACT 1500 

55 AAAATTTAGC AAACCTGTGT TTGCATATTT TrTKGGAGTG CAGMOTAV/TG TAATTARAGC 1560 

ATTCCAGTAA NAGTGTNTTT AAAGTTGmC TATATN 1596 

60 
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(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 
-5 (A) LENGTH: 2293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

GCGCAGAGCC CGYACGAGCA GGACGACGAC GACAAGGGCG ACTCCAAGGA AACGCGGCTG 60 

ACCCTGATGG AGGAAGTGCT CCTGCTGGGC CTCAAGGACC GCGARGGTTA CACATCATTT 120 

15 

TGGAATGACT GTATATCATC TGGATTACGT GGCTGTATGT TAATTGAATT AGCATTGAGA 180 

GGAAGGTTAC AACTAGAGGC TTGTGGAATG AGACGTAAAA GTCTATTAAC AAGAAAGGTA 240 

20 ATCTGTAAGT CAGATGCTCC AACAGGGGAT GTTCTTCTTG ATGAAGCTCT GAAGCATGTT 300 

AAGGAAACTC AGCCTCCAGA AACGGTCCAG AACTGGATTG AATTACTTAG TGGTGAGACA 360 

TGGAATCCAT TAAAATTGCA TTATCAGTTA AGAAATGTAC GGGAACGATT AGCTAAAAAC 420 

25 

CTGGTGGAAA AGGGTGTATT GACAACAGAG AAACAGAACT TCCTACTTTT TGACATGACA 480 

ACACATCCCC TCACCAATAA CAACATTAAG CAGCGCCTCA TCAAGAAAGT ACAGGAAGCC 540 

30 GTTCTTGACA AATGGGTGAA TGACCCTCAC CGCATGGACA GGCGCTTGCT GGCCCTCATT 600 

TACCTGGCTC ATGCCTCGGA CGTCCTGGAG AATGCTTTTG CTCCTCTTCT GGACGAGCAG 660 

TATGATTTGG CTACCAAGAG AGTGCGGCAG CTTCTCGACT TAGACCCTGA AGTGGAATGT 720 

35 

CTGAAGGCCA- ACACCAATGA-GGTTGTGTGG <5C<X?PGGTGG ^^GGGGTTGAG-GAASTAACTG 780- 

TGCTCGGGGT GAACCATTCT CCTTTCTCTC AAGTAAACCA GTAGTTTTTC TTCTGTTGAC 840 

40 TTCTGGTTTT CTGTAATTTG TACTTTCCCA CACTATAATT GGCTTCTGTT TTACAAAATG 900 

GTGGGTQGCT TTTTCTITTT TGTACGTGTA CAGGATTCTG CTGGTACGAG AGGCCTTCCT 960 

CTTTCTGTTT TTAAAAAAAG TTTTACTGCC ATATTGGCAT TCCATTCCCT GTTGCCATCC 1020 

45 

TCACTGTTAC CTGTTTTGGG TTTCTGGTCT ACTTTGACTT TCAAAGTACC TCCAGCCTCC 1080 

TCATACGCAC AGCTTTTGGA TGACCTCAGC TTGAGTTTCT CCATATGTGC ATGTACATCT 1140 

50 AGCATTCTGC CTACAGTTCA GACAGAAGTC ACAAAAAGGC CTTCAACTCA CCAAAGGTAA 1200 

ATATCTGTAT CTATTAGGAC ATTTTTTACA TAGACTTCAG TTGAGATGTA TACTTAGCAA 1260 

AATTATTTTT AAATTGAAAC AGCACAGTAA ATACTTAATA TAAAATGTCC CTTGGATTTT 1320 

55 

GCTTCCCATG TAAATCTATT GTATTATTAC ACTTGTTATA ATTTTAACTA TAAAGGTCCA 1380 

ATTGTTTCAC AGAGCCAGTT TGGGATGGGC TGCATTCCAT TTATGCTCTA TATAGTTTGA 1440 

60 ATTATATATA AATTACCCCT TCTTCTGGCC ACCCCTGCTC CCATCTTAGT ATTTTGCAAG 1500 
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ATCTAATCAG TTGTACACCT GGTGCCCCTC GCTTGCTTCA ATCATGGTTA TTTGATGGCA 1560 

AAATCXaACCT CTTGTCGCTG AAGGAGAGAG AAAAGATGTG TGTCTGATTG GTCCTCGGAT 1620 

_5 

TTTTTGAGCT GTGCCATTTA TGGTACTCTT TGCXTTATGCA TCCCCTTTTT AGATTTTTTT 1680 

TAAATTTTAT CTTACIOTTT TTATAATTTC TATTGGGAAG AGGCTTGTGA CCAGrTACCAA 1740 

10 TCTTGAGTTT CTTTTTCTGT CCACAAGTAA ATTAATATCT GCTCTGAAAT GTCATTTATC 1800 

TACTCACACA TTCTTGGGGA AAAAAATCAA ATGTCAGTCC TAGCAGATGT TGCATGTAAA 1860 

TTGGTAGCAA GTAATGATTA CAACCCAGAG GATTAAGAAT TTTGTAACAG AAAGCTCTAT 1920 

15 

GTTTTAATTT TTTATATACA ATTAGGATAA TTAGCATTGT CAGACTATAA ACCTTTGCTT 1980 

TTTAAAGTTT ATTTTTACTA TTTCTTTATC ACTTTATTGT ATCATCACCA TTGGTrTCAT 2040 

20 AATGTAAATA CTATATGTTG AACAAATTAA ATGTCAAAAT TTTTTATTAC CATAGTCCAT 2100 

GTTAATAGTG GGGCTTTCAG GTGTTTAGAG ATTTITrrTG TTGTTGTTAA CATTCATTGC 2160 

AAAAGTACTA GATGGTGTAT AACTCTAGAG 1TGAATTTTA AGGGATTCCX: TAATATGTAT 2220 

25 

ACTATCTTTT TATCTGAAGT AATAAATAAA CAATGATCTT GAAAGTGCCY RAAAMAAAAA 2280 

AAAAAAAAAA AAA 2293 

30 



(2) INFORMATICTJ FOR SEQ ID NO: 186: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 base pairs 
- - fB^ - TifPR:- nucleic- acid- ~ - - . . _ — , ^ - 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 
GGCACGAGGC GAGCCGGCGC ACCGTACGCT GGGACGTGTG GTTTCAGCTC GTGCGCCTCC 60 
45 CCGTGGGTTT GCGACGTTTA GCGACTATTG CGCCTGCGCC ACGCCGGCTG CGAGACTGGG 120 
GCCGTGGCTG CTGGTCCCGG GTGATGCTAG GCGGCTCCCT GGGCTCCAGG CTGTTGCGQG 180 
GTGTAGGTGG GAGTCACGGA CGGTTCGGGG CCCGAGGTGT CCGCGAAGGT GGCGCACATG 240 

50 

GGCGGCAGGG GAGAGCATGG CTCAGCGGAT GGTCTGGGTG GACCTGGAGA TGACAGGATT 300 
GGACATTGAG AAGGACCAGA TTATTGAGAT GGCCTGTCTG ATAACTGACT CTGATCTCAA 360 
55 CATTITGGCT GAAGGTCCTA ACCTGATTAT AAAACAACCA GATGAGTTGC TGGACAGCAT 420 
GTCAGATTGG TGTAAGGAGC ATCACGGGAA GTCTGGCCTT ACCAAGGCAG TGAAOGAGAG 480 
TACAATTACA TTGCAGCAGG CAGAGTATGA ATTTCTGTCC TTTGTACGAC AGCAGACTCC 540 

60 
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TCCAGGGCTC TGTCCACTTG CAGGAAATTC AGTTCATGAA GATAAGAAGT TTCTTGACAA 600 

ATACATGCCC CAGTTCATGA AACATCTTCA TTATAGAATA ATTGATGTGA GCACTGTTAA 660 

_5 AGAACTGTGC AGACGCTGGT ATCCAGAAGA ATATGAATTT GCACCAAAGA AGGCTGCTTC 720 

TCATAGGGCA CTTGATGACA TTAGTGAAAG CATCAAAGAG CTTCAGTTTT ACCGAAATAA 780 

CATCTTCAAG AAAAAAATAG ATGAAAAGAA GAGGAAAATT ATAGAAAATG GGGAAAATGA 840 

10 

GAAGACC3GTG AGTTGATGCC AGTTATCATG CTGCCACTAC ATCGTTATCT GGAGGCAACT 900 

TCTGGTGGTT TTTTTTTCTC ACGCTGATGG CTTGGCAGAG CACCTTCGGT TAACTTGCAT 960 

15 CTCCAGATTG ATTACTCAAG CAGACAGCAC ACGAAATACT AT TTTTCTCC TAATATGCTG 1020 

TTTCCATTAT GACACAGCAG CTCCTTTGTA AGTACCAGGT CATGTCCATC CCTTGGTACA 1080 

TATATGCATT TGCTTTTAAA CCATTTCTTT TGTTTAAATA AATAAATAAG TAAATAAAGC 1140 

20 

TAGTTCTATT GAAATGCAAA AAAAAAAAAA AAAA?VAAAAA AAAAAAAAAA AAAAAAAAAA 1200 

AAAAAAAAAA AN 1212 

25 



<2) INFORMATION FOR SEQ ID NO: 187: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 



GCTTCCGGAA GTTGCTTTTG TCCAAACATC CGGGCTTCTC CTTTTTGTGT TCCGGCCGAT 60 

40 CCCACCTCTC CTCGACCCTG GACGTCTACC TTCCGGAGGC CCACATCTTG CCCACTCCGC 120 

GCGCGGGGCT AGCGCGGGTT TCAGCGACGG GAGCCCTCAA GGGACATGGC AACTACAGCG 180 

GCGCCGGCGG GCGGCGCCCG AAATGGAGCT GGCCCGGAAT QGGGAGGGTT CGAAGAAAAC 240 

45 

ATCCAGGGCG GAGGCTCAGC TGTGATTGAC ATGGAGAACA TGGATGATAC CTCAGGCTCT 300 

AGCTTCGAGG ATATGGGTGA GCTGCATCAG CGCCTGCGCG AGGAAGAAGT AGACGCTGAT 360 

50 GCAGCTGATG CAGCTGCTGC TGAAGAGGAG GATQGAGAGT TCCTGGGCAT GAAGGGCTTT 420 

AAGGGACAGC TGAGCCGGCA GGTGGCAGAT CAGATGTGGC AGGCTGGGAA AAGACAAGCC 480 

TCCAGGGCCT TCAGCTTGTA CGCCAACATC GACATCCTCA GACCCTACTT TGATGTQGAG 540 

55 

CCTGCTCAGG TGCGAACAGG GCTCCTGGAG TCCATGATCC CTATCAAGAT GGTCAACTTC 600 

CCCCAGAAAA TTGCAGGTGA ACTCTATGGA CCTCTCATGC TGGTCTTCAC TCTGGTTGCT 660 

60 ATCCTACTCC ATGQGATGAA GACGTCTGAC ACTATTATCC GGGAGGGCAC CCTGATQGGC 720 
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ACAGCCATTG GCACCTGCTT CGGCTACTGG CTGGGAGTCT CATCCTTCAT 'TrACTTCCTT 780 

GCCTACCTGT GCAACGCCCA GATCACCATG CTGCAGATGT TGGCACTGCT GGGCTATGGC 840 

-5 

CTCTTTGGGC ATTGCATTGT CCTGTTCATC ACCTATAATA TCCACCTCCA CGCCCTCTTC 900 

TACCTCTTCT GGCTGTTGGT GGGTGGACTG TCCACACTGC GCATGGTAGC AGTGTTGGTG 960 

10 TCTCGGACCG TGGGCCCCAC ACAGCGGCTG CTCCTCTGTG GCACCCTGGC TGCCCTACAC 1020 

ATGCTCTTCC TGCTCTATCT GCATTTTGCC TACCACAAAG TGGTAGAGGG GATCCTGGAC 1080 

ACACTGGAGG GCCCX:AACAT CCCGCCCATC CAGAIGGGTCC CCAGAGACAT CCCTGCCATG 1140 

15 

CTCCCTGCTG CTCGGCTTCC CACCACCX5TC CTCAACX3CCA CAGCCAAAGC TGTTGCGGTG 1200 

ACXrCTGCAGT CACACTGACC CCACCTGAAA TTCTTGGCCA GTCCTCTTTC CXXX:AGCTGC 1260 

20 AGAGAGGAGG AAGACTATTA AAGGACAGTC CTGATGACAT GTTTCGTAGA TGGGGTTTGC 1320 

AGCTGCX:ACT GAGCTGTAGC TGCGTAAGTA CCTCCTTGAT GCNTGTCGGC ACTTCTGAAA 1380 

GGCACAAGGC CAAGAACTCC TGGCCAGGAC TGCAAGGCTC TGCAGCCAAT GCAGAAAATG 1440 

25 

ggtcagctcc tttgagaacc cctccccacc taccccitcc ttcctcttta tctctccx:ac 1500 

attgtcttgc taaatataga cttggtaatt aaaatgttga ttgaagtctg gaaaaaaaaa 1560 

30 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaac tcgag 1605 



35 (2) INFORMATION FOR SEQ ID NO: 188: 



- ' ny SEQUENCE -CHARACTERISTICS': " " ' " ^ . 

<A) LENGTH: 1516 base pairs 
(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

45 ATTCGGCATG AGGGGGTCAC GTGGTGGCTG GGCCGGGGAA ATGGCGGCTT CAGGAGAGAG 60 

CGGGACTTCA GGCGGCGGAG GCAGCACCGA GGAAGCATTT ATGACCTTCT ACAGTGAGGT 120 

GAAACAAATA GAGAAGAGAG ACTCGGTTCT AACTTCGAAA AATCAGATTG AAAGACTGAC 180 

50 

CCGTCCTGGT TCCTCTTACT TCAATTTGAA CCCATTTGAG GTTCTTCAGA TAGATCCTGA 240 

AGTTACAGAT GAAGAAATAA AAAAGAGGTT TCGGCAGTTA TCCATCTTGG TGCATCCTGA 300 

55 CAAAAATCAA GATGATGCTG ACAGAGCACA AAAGGCTTTT GAAGCTGTGG ACAAAGCITA 360 

CAAGTTGCTA CTGGATCAGG AGCAAAAGAA GAGGGCCCTG GATGTAATTC AGGCAGGAAA 420 

AGAATACGTG GAACACACTG TGAAAGAGCG AAAAAAACAA TTAAAGAAGG AAGGAAAACC 480 

60 
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TACAATTGTA GAGGAGGATG ATCCTGAGCT GTTCAAACAA GCTGTATATA AACAGACAAT 540 

GAAACTCTTT GCAGAGCTGG AAATTAAAAG GAAAGAGAGA GAAGCCAAAG AGATGCATGA 600 

- 5 AAGGAAACGA CAAAGGGAAG AAGAGATTGA AGCICAAGAA AAAGCCAAAC GGGAAAGAGA 660 

GTGGCAGAAA AACTTTGAGG AAAGTCGAGA TGGTCGTGTG GACAGCTGGC GAAACTTCCA 720 

AGCCAATACG AAGGGGAAGA AAGAGAAGAA AAATCGGACC TTCCTGAGAC CACCGAAAGT 780 

10 

AAAAATQGAG CAACGTGAGT GACCGCCCAA GGTCACAGGC ACAGAACCTT TCCCCTGCTA 840 

TCTCCCTTCC TGCTTCGAAG GACTCATTCT TTCCTCCCAC TTCCACCCCA ACATAGAGTA 900 

15 GTATTTGCTT TTTAGTCCAT TTTGTTTTCA ATACGATTTA ATATCGATCA GAGTAATTCT 960 

TTTGTACATT GAAATGAGGG GCTTGGTTTA AAAAAAGACC TTTCCCTCTC CCTGCCXXTTA 1020 

GAACAACCAG TATTAGAAGG TGCCACCATT GGTGCTGCCT TCTCTTCCCA CAGCCTGTAA 1080 

20 

CTCAGTGTTT TGTACTTCAC TGAATTGTGA TGGTTAGAAA CTTCGTGGAT AGTTTGTGGA 1140 

AATCATCCAA TTAAACATAC TGCTTAAAAC AGTGTTGCTG TGACTTCAGA GACAAGCCTG 1200 

25 GAAGGGGCAC CTTAGGAAGC CCCTTCGCTT CAGTTGCTCG CTTCTGGGTG TGCTCCCTTC 1260 

GAAGGCCCAG ATAAGACAGG GAACACTTGT GAGCACACAG AGCAGCATCT GATGCCCTGT 1320 

GGTCTTTGGC ATGTGCCCCC TGTCTACTGA CCAATCAGTG TGGCATGAGG CCCACGCCAC 1380 

30 

CCAAACCTTT CACTTTCCAA AGAGCTAGCC GTCCTCCACC CAGTACCATG TCCTAGCCTG 1440 

TCTGCATTTG TTAGTGGTAA TATTCTTTAT GTATAATAAA TTTTTATACC CAAAAAAAAA 1500 

35 AAAAAAAAAA ACTCGA 1516 



40 (2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 
45 <C) STRANDEr»3ESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 



50 GCTCCCATGT TGCTGGCTGT CCGTACATCA CCCTGTCCCC TGCAGGAGGG GGCTACAGGC 60 

CATCTCCCTC CTGTAGGCCT CTGACTCCCC TCCACTTTTG GGCCCTCAGC TTATCTCGGG 120 

CAGGGGACCA TTGCAGCATC CTCCCCTCCT CNGGACTCAA GGTGCTGAGG TATAAGCCCT 180 

55 

GGGCCCCAGA TCCCTGRTKA CACCTTCCTG GAGAAGACTC TCAAAAGTGA CTGTATATTT 240 

GAGTTCACCA GCAATAACTC CCCACACTCG AAGCAGGTCC AAACCCMAGG ATCCCAGGGT 300 

60 CCTTGGGCTC TGTGGCACTG TCTTCCCAAG ATCCTTCCTG TTGCACAATG GGAAACCTAA 360 



wo 98/39448 



PCT/US98/04493 



419 



GAGGAAAAAG ACAGGGGCCT GCTTGCCCAG CXATGCGAGG GATTCCATGC CCACCTGCCX: 420 

TCTGYCTGCC TCGCTGGAAT GTGGGCCCCT GCTCCCCGTC AGGTTGTGCT GTCTCTGACC 480 

5 

TATGTTTACA TCCCCGAGGG GTTTCTGCCT CXTTCCCCACC CAGGTCAGGG TGTGGTCCAG 540 

CAGCTTGCTG TGGGGTGCTG ACATGTGTCA CCACTGCCCC CCTTGCCCCC GGGGGGGTCA 600 

10 TGGTCTCCTC CTGGATGCTG CTCCTTGAAT YTTTTTTYTT GAWAAACCYT TTAMAATTAA 660 

AAAAAAAAAA AAAAAACTCG A 681 



15 



(2) INFORMATION FOR SEQ ID NO: 190: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH; 1014 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

GCCTCAAGCC ACGCATATGA TAATTTTCTG GAACATTCAA ATTCAGTGTT TCTACAGCCA 60 

GTTAGTCTAC AAACCATTGC AGCAGCACCA TCAAACCAGA GTCTGCCACT TTTTGTCATC 120 

30 

GCTGGATGAT TGCTGGGCAA AGGTGGCCTT TTAGAGCTCT TAAAAGCCCA CAAAAAGGCT 180 

ATTCGTAGAG CCACAGTCAA CACATTTGGT TATATTGCAA AGGCCATTGG CCTCATGATG 240 

35 TATTGGCTAC ACTTCTGAAC AACCTCAAAG TTCAAGAAAG GCAGAACAGA GTTTGTACCA 300 

CTGTAGCAAT "AGCTATTGTT-GCAGAAACAT" GTTCACCCTT" TACJU^ "3^0" 

TGAATGAATA CAGAGTTCCT GAACTGAATG TTCAAAATGG AGTGTTAAAA TCGCTTTCCT 420 

40 

TCTTGTTTGA ATATATTGGT GAAATGGGAA AAGACTACAT TTATGCCGTA ACACCGTTAC 480 

TTGAAGATGC TTTAATGGAT AGAGACCTTG TACACAGACA GACGGCTAGT GCAGTGGTAC 540 

45 AGCACATGTC ACTTGGGGTT TATGGATTTG GTTGTGAAGA TTCGCTGAAT CACTTGTTGA 600 

ACTATGTATG GCCCAATGTR TTTGAGACAT CTCCTCATGT AATTCAGGCA GTTATGGGAG 660 

CCCTAGAGGG CCTGAGAGTT GCTATTGGAC CATGTAGAAT GTTGCAATAT TGTTTACAGG 720 

50 

GTCTGTTTCA CCCAGCCCGG AAAGTCAGAG ATGTATATTG GAAAATTTAC AACTCCATCT 780 

ACATTGGTTC CCAGGACGCT CTCATAGCAC ATTACCCAAG AATCTACCAA CGATGATAAG 840 

55 RACACCTATA TTCGTTATGA ACTTGACTAT ATCTTATAAT TTTATTGTTW ATTTKGTGKT 900 

TAATGCACAS TACTTCACAC CTTAAACTTG CTTTGATTTG GTGATGTAAA CTTTTAAACA 960 

TTGCAGATCA GTGTAGGACT GGTCCATAGG GGAAGAGCTA GGAANTCCAT AGGC 1014 

60 
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(2) INFORMATION FOR SEQ ID NO: 191: 



45 



50 



55 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2779 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

TCGCAGCAGG GTGTGTCCAG ATGGTCAGTC TCTGGTGGCT AGCCTGTCCT GACAGGGGAG 60 

AGTTAAGCTC CCGYTCTCCA CCGTGCCGGC TGGCCAGGTG GGCTGAGGGT GACCGAGAGA 120 

CCAGAACCTG CTTGCTGGAG CTTAGTGCTC AGAGCTGGGG AGGGAGGTTC CGCCGCTCCT 180 

CTGCTGTCAG CGCCGGCAGC CCCTCCCGGC TTCACTTCCT CCCGCAGCCC CTGCTACTGA 240 

GAAGCTCCGG GATCCCAGCA GCCGCCACGC CCTGGCCTCA GCCTGCGGGG CTCCAGTCAG 300 

GCCAACACCG ACGCGCANTG GGAGGAAGAC AGGACCCTTG ACATCTCCAT CTGCACAGAG 360 

GTCCTGGCTG GACCGAGCAG CCTCCTCCTC CTAGGATGAC CTCACCCTCC AGCTCTCCAG 420 

TTTTCAGGTT GGAGACATTA GATQGAGGCC AAGAAGATGG CTCTGAGGCG GACAGAGGAA 480 

AGCTGGATTT TGGGAGCGGG CTGCCTCCCA TGGAGTCACA GTTCCAGGGC GAGGACCGGA 540 

AATTCGCCCC TCAGATAAGA GTCAACCTCA ACTACCGA7VA GGGAACAGGT GCCAGTCAGC 600 

CGGATCCAAA CCGATTTGAC CGAGATCGGC TCTTCAATGC GGTCTCCCGG GGTGTCCCCG 660 

AGGATCTX3GG-TGGAGTTGGA-GAGTACGTGA-GGAAGA€GAG-CAAiGTAGCTG-^ 720- 

AATACACAGA GGGCTCCACA GGTTAAGACGT GCCTGATGAA GGCTGTGCTG AACCTTAAGG 780 

ACGGGGTCAA TGCCTGCATT CTGCCACTGC TGCAGATCGA CCQGGACTCT GGCAATCCTC 840 

AGCCCCTGGT AAATGCCCAG TGCACAGATG ACTATTACCG AGGCCACAGC GCTCTGCACA 900 

TCGCCATTGA GAAGAGGAGW CTGCAGTGTG TGAAGCTCCT GGTGGAGAAT GGGGCCAATG 960 

TGCATGCCCG GGTCTGCGGC GCTTCTTCCA GAAGGGCCAA GGGACTTGCT TTTATTTCGG 1020 

TGAGCTACCC CTCTYTTTGG CCGCTTGCAC CAAGCAGTGG GATGTGGTAA GCTACCTCCT 1080 

GGAGAACCCA CACCAGCCCG CCAGCCTGCA GGCACTGACT CCCAGGGCAA CACAGTCCTG 1140 

CATGCCCTAG TGATGATCTC GGACAACTCA GCTGAGAACA TTGCACTGGT GACCAGCATG 1200 

TATGATGGGC TCCTCCAAGC TGGGGCCCGC CTCTGCCCTA CCGTGCAGCT TGAGGACATC 1260 

CGCAACCTGC AGGATCTCAC GCCTCTGAAG CTGGCCGCCA AGGAGGGCAA GATCGAGATT 1320 

TTCAGGCACA TCCTGCAGCG GGAGTTTTCA GGACTGAGCC ACCTTTCCCG AAAGTTCACC 1380 

GAGTGGTGCT ATGGGCCTGT CCGGGTGTCG CTGTATGACC TGGCTTCTGT GGACAGCTGT 1440 
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GAGGAGAACT CAGTGCTGGA GATCATTGCC TTTCATTGCA AGAGCCCGCA CCGACACCGA 1500 

ATGGTCGTTT TGGAGCCCCT GAACAAACTG CTGCAGGCGA AATGGGATCT GCTCATCCCC 1560 

AAGTTCTTCT TAAACTTCCT GTGTAATCTG ATCTACATGT TCATCTTCAC CGCTGTTGCC 1620 

TACCATCAGC CTACCCTGAA GAAGCAGGCC GCCCCTCACC TGAAAGCGGA GGTTGGAAAC 1680 

10 TCCATGCTGC TGACX3GGCCA CATCCTTATC CTGCTAGGGG GGATCTACCT CCTCGTGGGC 1740 

CAGCTGTGGT ACTTCTGGCG GCGCCACGTG TTCATCTGGA TCTCGTICAT AGACAGCTAC 1800 

TTTGAAATCC TCTTCCTGTT CCARGCCCTG CTCACAGTGG TC3TCCCARGT GCTGTGTTTC 1860 

15 

CTGGSCATCG AGTGGTACCT GCCCCTGCTT GTGTCTGCGC TGGTGCTGGG CTGGCTGAAC 1920 

CTGCTTTACT ATACACX5TGG CTTCCAGCAC ACAGGCATCT ACAGTGTCAT GATCCAGAAG 1980 

20 CCCTGGTGAG CCTGAGCCAG GANNnTGGCG CCCCGAAGCT CCTACAGGCC CCAATGCCAC 2040 

AGAGTCAGTG CAGCCCATGG AGGGACAGGA KGACGAKGGC AACGGGGCCC AGTACAGGGG 2100 

TATCCTGGAA GCCTCCTTGG AGCTCTTCAA ATTCACCATC GGCATGGGCX; AGCTGGCCTT 2160 

25 

CCAGGARCAG CTGCACTTCC GCGGCATGGT GCTGCTGCTG CTGCTGGSCT ACGTGCTGCT 2220 

CACCTACATC CTGCTGCTCA ACATGCTCAT CXXTCCTCATG AGCGAGACCG TCAACAGTGT 2280 

30 CGCCACTGAC AGCTGGAGCA TCTGGAAGCT GCAGAAAGCC ATCTCTGTCC TGGAGATGGA 2340 

GAATGGCTAT TQGTGGTGCA GGAAGAAGCA GCGGGCAGGT GTGATGCTGA CCGTTGGCAC 2400 

TAAGCCAGAT GGCAGCCCSG ATGAGCGCTG GTGCTTCAGG GTGGAGGAGG TGAACTGGGC 2460 

35 

TTCATGGGAG CAGACGCTGC CTACGCTGTG TGAGGACCCG TCAGGGGCAG GTGTCCrTCG 2520 

AACTCTCGAG AACCCTGTCC TGGCTTCCCC TCCCAAQGAG GATGAGGATG GTGCCTCTGA 2580 

40 GGAAAACTAT GTGCCCGTCC AGCTCCTCCA GTCCAACTGA TGGCCCAGAT GCAGCAGGAG 2640 

GCCAGAGGAC AGAGCAGAGG ATCTTTCCAA CCACATCTGC TGGCTCTGGG GTCCCAGTGA 2700 

ATTCTGGTGG CAAATATATA TTTTCACTAA CTCAAAAAAA AAAAAAAAAA AAAAAAAAAA 2760 

AAAAAAAAAA AAAAAAOGC 2779 



45 



50 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1923 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

60 
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ACCCGCTCCG CTCCGCTCCG CTCGGCCCCG 
TGGCCTGCGA GCGCTGCCGC TGGATCCTGC 
-5 ACATCATCGC GCTGGCCGGC CGCX5GCTGGT 
CGCTGTGGTG GAAATGCTCC CAAGAGGGCG 
AGAGCCTCAT GGAGTACGCG TGGGGTAGAG 

10 

TCATCCTGGT GATCTGTTTC ATCCTCTCCT 
TCTTCCTGAG AGTCATTGGA GGTCTCCTTG 
15 TGGTAATTTA CCCCGTGAAG TACACCCAGA 
CTTACATCTA TAACTGGGCC TACGGCTTTG 
GTGCCTTCTT CTTCTGCTGC CTCCCCAACT 

20 

CCAGGTACTT CTACACATCT GCCTAACTTG 
TGAGATGGAC TCCAGAAGAA GAAACTGTTT 
25 AGTGTTCATA TTATTAAACT AGTCAAAAAT 
AAGTAGTGTT ATAGTTTCAT GTrTATCTTT 
CTAATTACCT ATACTATGCC AATATTTCCT 

30 

TGTAAGAGAA TATGCACGTG AAACTTAACA 
TTTAATAATC TGATCAAGTT CTTGTTATTT 
35 CTAAGGAGAA GAGGAAGATA AGGTTAAAAG 
GCAAAAAAAA AGTTOATTTT CAAGCCTTCG 
AAATGCATAT CATTTGTGAG AATTTCTCAT 

40 

GCTTCATGTT GACTCGATAT GTCATCTAGG 
CCATAGTTGG TAAGGCTTTC CTTTAAGTGT 
45 AGTrCTTTAT AGGGTTAGGG TGTGGGAAAA 
GTTTATATGT TCAGAACCAG AGTAGACTGG 
TGACTGATAG ATCTGGTTAA GTTGTGTAGT 

50 

AAGTGCCACT AAAACAGCCT CAGGAGAATA 
TCTGGGCTCT ATCATATAGA CAGGCTTCTG 
55 TATAGTTAAA ATCCTGGTCT TTCTTGGTAA 
CCACAGGAGA ATTCGGQGAT TTGAGTTTCT 
GCTCATTATG ATTTTTTACC ATTTCGACTT 

60 



CGCCGCCCGT CAACATGATC CXXTTGCGGCC 60 

CCCTGCTCCT ACTCAGCGCC ATCGCCTTCG 120 

TGCAGTCTAG CGACCACGGC CAGACGTCCT 180 

GCGGCAGCX3G GTCCTACGAG GAGGGCTGTC 240 

CAGCGGCTGC CATGCTCTTC TGTGGCTTCA 300 

TCTTCGCCCT CTGTGGACCC CAGATGCTTG 360 

CCTTGGCTGC TGTGTTCCAG ATCATCTCCC 420 

(XTTCACCCT TCATGCCAAC CSTGCTGTCA 480 

GGTGGGCAGC CACGATTATC CTGATYGGCT 540 

ACGAAGATGA CCTTCTGGGC AATGCCAAGC 600 

GGAATGAATG TCGGAGAAAA TCGCTGCTGC 660 

CTCCAGGCGA CTTTGAACCC ATTTTTTGGC 720 

GCTAAAATAA TTTGGGAGAA AATATTTTTT 780 

TATTATGTTT TGTGAAGTTG TGTCTTTTCA 840 

TATATCTATC CATAACATTT ATACTACATT 900 

CTTTATAAGG TAAAAATGAG GTTTCCAAGA 960 

CCAAATAGAA TGGACTCGGT CTGTTAAGGG 1020 

TTGTTAATGA CCAAACATTC TAAAAGAAAT 1080 
AACTATTTAA GGAAAGCAAA ATCATriCCT " 1140' 
TAATATCCTG AATCATTCAT TTCAGCTAAG . 1200 

AAAGTACTAT TTCATGGTCC AAACCTGTTG 1260 

GAAATATTTA GATGAAATTT TCTCTTTTAA 1320 

TGCTATATTA ATAAATCTGT AGTGTTTTGT 1380 

ATTGAAAGAT GGACTGGGTC TAATTTATCA 1440 

AAAGCATTAG GAGGGTCATT CTTGTCACAA 1500 

AATGACTTGC TTTTCTAAAT CTCAGGTrTA 1560 

ATAGTTTGCA ACTGTAAGCA GAAACCTACA 1620 

ACAGATTTTA AATGTCTGAT ATAAAACATG 1680 

CTGAATAGCA TATATATGAT GCATCGGATA 1740 

ACATAATGAA AACCAATTCA TTTTAAATAT 1800 



5DOCID: <W0 983944aA2^IA> 



wo 98/39448 



PCT/US98/04493 



423 

CAGATTATTA TTTTGTAAGT TGTGGAAAAA GCTAATTGTA GTTTTCATTA TCAAGTTTTC 1860 

CCAATAAACC AGGTATTCTA AAAAAAAAAA AAAAAAACTN GAGGGGGGGC CCGGTACCCA 1920 

-5 ATT X923 

10 (2) INFORMATION FOR SEQ ID NO: 193:- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2346 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : double 

(D) TOPOIOGY: linear 

(xi) SEQUETTCE DESCRIPTION: SEQ ID NO: 193: 

20 AGGCTCAGGG GGACACTCTC AAAATTACAC AGCTTTTAAC AGGTGGCAGA ATTGGGGTTC 60 

AGACCCAGAT CTGGGTTCAA GTCACTCATG GTGTGATTGC GGCATTCCTT CCCGCATCTG 120 

GGCCTTGCCA TCTCTCTCTC CGAGTGGACA TGGAGAGGAC GGGGGCCCAG CAGCTGGATG 180 

25 

GCTGCAGGGG ATCAAGTCTT CTCTGGGGCT GGGCACGTAN AAGAGCATGT GGCTGGTGGA 240 

CGGCATGCCT GGCTCCTCAC CTGGCAGTCT GCCTGCCCTG CTAACCGGCT GTCTCTTGTT 300 

30 CCCCTAGTGC CCTCGGCTAG CATGACCCGC CTGATGCGWT SCCGCACAGC CTCTGGITCC 360 

AGCGTCATTC TCTGGATGGC ACCCGCAGCC GCTCCCACAC CAGCGAGGGC ACCCGAAGCC 420 

GCTCCCACAC CAGCGAGGGC ACCCGCAGCC GCTCGCACAC CAGCGAGGGG GCCCACCTGG 480 

35 

ACATCACCCC CAACTCGGGT GCTGCTGGGA ACAGNGCCGG GCCCAAGTCC ATGGAGGTCT 540 

CCTGCTAGGC GGCCTGCCCA GCTGCCGCCC CCGGACTCTG ATCTCTGTAG TGGCCCCCTC 600 

40 CTCCCCGGCC CCTTTTCGCC CCCTGCCTGC CATACTGCGC CTAACTOGGT ATTAATCCAA 660 

AGCTTATTTT GTAAGAGTGA GCTCTGGTGG AGACAAATGA GGTCTATTAC GTGGGTGCCC 720 

TCTCCAAAGG CGGGGTGGCG GTGGACCAAA GGAAGGAAGC AAGCATCTCC GCATCGCATC 780 

45 

CTCTTCCATT AACCAGTGGC CGGTTGCCAC TCTCCTCCCC TCCCTCAGAG ACACCAAACT 840 

GCCAAAAACA AGACGCGTAC AGCACACACT TCACAAAGCC AAGCCTAGGC CGCCCTGAGC 900 

50 ATCCTGGTTC AAACGGGTGC CTGGTCAGAA GGCCAGCCGC CCACTTCCCG TTTCCTCTTT 960 

AACTGAGGAG AAGCTGATCC AGTTTCCGGA AACAAAATCC TTTTCTCATT TGGGGAGGGG 1020 

GGTAATAGTG ACATGCAGGC ACCTCTTTTA AACAGGCAAA ACAGGAAGGG GGAAAAGGTG 1080 

55 

GGATTCATGT CGAGGCTAGA GGCATTTGGA ACAACAAATC TACGTAGTTA ACTTGAAGAA 1140 

ACCGATTTTT AAAGTTGGTG CATCTAGAAA GCTTTGAATG CAGAAGCAAA CAAGCTTGAT 1200 

60 TTTTCTAGCA TCCTCTTAAT GTGCAGCAAA AGCAGGCRAC AAAATCTCCT GGCTTTACAG 1260 
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40 



ACAAAAATAT TTCAGCAAAC GTTGGGCATC ATGGTTTTTG AAGGCTTTAG TTCTGCTTTC 1320 

TGCCTCTCCT CCACAGCCCC AACCTCCCAC CCCTGATACA TGAGCCAGTC ATTATTCTTG 1380 

TTCAGGGAGA AGATCATTTA GATTTGTTTT GCATTCCTTA GAATGGAGGG CAACATTCCA 1440 

CAGCTGCCCT GGCTGTGATG AGTGTCCTTG CAGGGGCCGG AGTAGGAGCA CTGGGGTGGG 1500 

GGCGGAATTG GGGTTACTCG ATCTAAGGGA TTCCTTGTTG TTGTGTTGAG ATCCAGTGCA 1560 

GTTGTGATTT CTGTGGATCC CAGCTTGGTT CCAGGAATTT TGTGTGATTG GCTTAAATCC 1620 

AGTTTTCAAT CTTCGACAGC TGGGCTGGAA OGTGAACTCA GTAGCTGAAC CTGTCTGACC 1680 

CGGTCACGTT CTTGGATCCT CAGAACTCTT TGCTCTTGTC GGGGTGGGGG TGGGAACTCA 1740 

CGTGGGGAGC GGTGGCTGAG AAAATGTAAG GATTCTQGAA TACATATTCC ATGGGACTTT 1800 

CCTTCCCTCT CCTGCTTCCT CTTTTCCTGC TCCCTAACCT TTCGCCGAAT GGGGCAGCAC 1860 

CACTGACX3TT TCTGGGCGGC CAC?rGCGGCT GCCAGGTTCC TGTACTACTG CXrTTGTACTT 1920 

TTCATTTTGG CTCACCGTGG ATTTTCTCAT AGGAAGTTTG GTCAGAGTGA ATTGAATATT 1980 

GTAAGTCAGC CACTGGGACC CGAGGATTTC TGGGACCCCG CAGTTGGGAG GAGGAAGTAG 2040 

TCCAGCCTTC CAGGTGGCGT GAGAGGCAAT GACTCGTTAC CTGCCGCCCA TCACCTTGGA 2100 

GGCCTTCCCT GGCCTTGAGT AGAAAAGTCG GGGATCGGGG CAAGAGAGGC TGAGTACGGA 2160 

TGGGAAACTA TTGTGCACAA GTCTTTCCAG AGGAGTTTCT TAATGAGATA TTTGTATTTA 2220 

TTTCCAGACC AATAAATTTG TAACTTTGCA TVAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2280 

AAAAAAAAAA AAAAAAAACT CGAGGGGGGC CCGTACCXrAA TTCGCCX3TAT ATGATCGTAA 2340 

ACAATC 2346 
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(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

{A> IiENGTH: 3054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 
TATCTGAACC ACCCTTTATT CTACATATGA TAGGCAGCAC TGAAATATCC TAACCCCCTA 
AGCTCMAGGT GCCCTGTGGN ACGAGCAACT GGACTATAGC AGGGCTGGGC TCTGTCTTCC 
TGGTCATAGG CTCACTCTTT CCCCCAAATC TTCCTCTGGA GCTTTGCAGC CAAGGTGCTA 
AAAGGAATAG GTAGGAGACC TCTTCTATCT AATCCTTAAA AGCATAATGT TGAACATTCA 



60 
120 
180 
240 
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TTCAACAGCT GATGCCCTAT AACCCCTGCC 
AGCAAGATCT TTACATAATT CAGAGTGGTT 
- 5 CTCCATTTAT TTGACTAAAG CATCACACAG 
AATACAGTGC TTTATGGCTC TAACATTACT 
GATGGCAGCC TCAGGGCTTC CTTATGTCCT 

10 

CTTTTTCCCC TATCCTGTTC TTCCCCTCCC 
GGTTCTTGGG CTAGGTAGTG GGGACCAAGT 
15 AAACTACGGT ACCAGTGTTA GTGGGAAGAG 
TACTCCTACC TGGTCAACCC GCTGCTTCCA 
TGATAAGGGA GAGGGAAATA CAAGGAGGGC 

20 

CACAAGCCAT AAACCAATAA AACAAGAATA 
ATTCCCACTG CACTTGGTGC TGCTTTGGCT 
25 GACAGGAAGA CTGGAGACTG^TCCACTTCTA 
AGAACTGCTA CCATGAAGTG AAAATGCCAC 
GAAAAACTGG CTTTTTCCCA GCCCTTTCCA 

30 

GTCCCATCAG CCTATTAOTr TOTTAAAGAA 
CTTCCTTCCT GCCCCAAAAT TATAAACTCT 
35 CTTGCTTGTA AAAATATGTA TTATACATCT 
CTGTGCCATT CTCCACTGAC TGCATTTGGG 
CATTGCAGGC CAGTGGACAG AGGGAGAAGG 

40 

TTTCTGACTG ATCCTGAACA AGAAAGAGTA 
CTCCAAAACA CTTATCCTCC TGCAAGAGTG 

45 TAAGCCCCXrr cctcacccct tccttttttc 

TGGAAAAGAA ACAATATGCT TTACACTCAT 
AAAAATACGG CAGGTGGCCT AAGGCTGCTG 

50 

TACAAGATAA AAAACGAATC CXCTAAACAA 
CCACCTTTCC TGTTCATGAC AGCTACTAAC 
55 AAAGTGGGTC ACCTGACCTC TGAAGAGCTG 
GATGCCAAGG AGGTCCCAGG AAGTCCAGCT 
GCAAGTGAGG CAAGAGAAAT GAGGAAGAAT 

60 



TGGATTTCTT CCTATTAGGC TATAAGAAGT 300 

TCATTGCCTT CCTACCCTCT CTAATGGCCC 360 

TGGCACTAGC ATTATACCAA GAGTATGAGA 420 

GCCTTCAGTA TCAAGGCTGC CTGGAGAAAG 480 

cx:accacaag AGCTCCTTGA TGAAGGTCAT 540 

CGCTCCTAAT GGTACXSTGGG TACCCAGGCT 600 

TCATTACXrrC CCTATCAGTT CTAGCATAGT 660 

CTGGGTTTTC CTAGTATACC CACTGCATCC 720 

GGTATGGGAC CTGCTAAGTG TGGAATTACC 780 

CTCTGGTGTT CCTGGCCTCA GCCAGCTGCC 840 

CTGAGTCAGT TTTTTATCTG GGTTCTCTTC 900 

GACTGGGAAC ACCCCATAAC TACAGAGTCT 960 

GCTCGGAACT TACTGTGTAA ATAAACTTTC 1020 

ATTTTGCTTT ATAATTTCTA CCCATGTTGG 1080 

GGGCATAAAA CTCAACCCCT TCGATAGCAA 1140 

AACTTGCACT TGTTTTTCTT TTTACAGTTA 1200 

AAGTGTAAAA AAAAGTCTTA ACAACAGdT 1260 

GTATTTTTAA ATTCTGCTCC TGAAAAATGA 1320 

GCCTTTCeCA TIXKHCTGCA -TCTCTTTTAT 138a " 

GAGAACAGGG GTCGCCAACA CTTGTGTTGC 1440 

ACACTGAGGC GCTOGCTCCC ATGCACAACT 1500 

GGCTTTCTAG GGTCTTTACT GGGAAGCAGT 1560 

TTTCTTTACT CCTTTGGCIT CAAAGGATTT 1620 

TTTCAATTTC TAAATTTGCA GGGGATACTG 1680 

TAAAGTTGAG GGGAGAGGAA ATCTTAAGAT 1740 

AAAGAACAAT AGAACTGGTC TTCCATTTTG 1800 

CTGGAGACAG TAACATTTCA TTAACCAAAG 1860 

AGTACTCAGG CCACTCCAAT CACCCTACAA 1920 

CCTTAAACTG ACGCTAGNCA ATAAACCTGG 1980 

CXATCTGTGA GGTGACAGGC AAGGATGAAA 2040 
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GACAAAGAAG GAAAAGAGTA TCAAAGGCAG AAAGGAGATC ATTTAGTTGG GTCTGAAAGG 2100 

AAAAGTCTTT GCTATCCX5AC ATGTACTGCT AGTACCTGTA AGCATTTTAG GTCCCAGAAT 2160 

GGAAAAAAAA ATCAGCTATT GGTAATATAA TAATGTCCTT TCCCTGGAGT CAGTTTTTTT 2220 

AAAAAGTTAA CTCTTAGTTT TTACTTGnTT AATTCTAAAA GAGAAGGGAG CTGAGGCCAT 2280 

TCCCTGTAGG AGTAAAGATA AAAGGATAGG AAAAGATTCA AAGCTCTAAT AGAGTCACAG 2340 

CTTTCCCAGG TATAAAACCT AAAATTAAGA AGTACAATAA GCAGAGGTGG AAAATGATCT 2400 

AGTTCCTGAT AGCTACCCAC AGAGCAAGTG ATTTATAAAT TTGAAATCCA AACTACTTTC 2460 

TTAATATCAC TTTGGTCTCC ATTTTTCCCA GGACAGGAAA TATGTCCCCC CCTAACTTTC 2520 

TTGCTTCAAA AATTAAAATC CAGCATCCCA AGATCATTCT ACAAGTAATT TTGCACAGAC 2580 

ATCTCCTCAC CCCAGTGCCT GTCTGGAGCT CACCCAAGGT CACCAAACAA CTTGGTTGTG 2640 

AACCNAACTG CCTTAACCTT CTGGGGGAGG QGGATTAGCT AGACTAGGAG ACCAGAAGTG 2700 

AATGGGAAAG GGTGAGGACT TCACAATGTT GGCCTGTCAG AGCTTGATTA GAAGCCAAGA 2760 

CAGTGGCAGC AAAGGAAGAC TTGGCCCAGG AAAAACCTGT GGGTTGTGCT AATTTCTGTC 2820 

CAGAAAATAG GGTGGACAGA AGCTTGTGGG GTGCATGGAG GAATTGGGAC CTGGTTATGT 2880 

TGTTATTCTC GGACTGTGAA TTTTGGTGAT GTAAAACAGA ATATTCTGTA AACCTAATGT 2940 

CTGTATAAAT AATGAGCX3TT AACACAGTAA AATATTCAAT AAGAAGTCAA AAAAAAAAAA 3000 

AAAAAACTCG AGGGGGGGCC CGGTACCCAA TTTNCCAAAT AGAGATNGTA TTAC 3054 
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(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDECNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

GGCAGAGCTC GTGGCCGNAA CTTTTTCTGC TCCTGGCTGC CACCTACTGG CTGGCCGCGG 60 

CCCTGGCCTG GGCCTGCACC AGCCTGCOTC CGGGCTCCCA CAGCAGCCCC CTTCCAAGCA 120 

GOGTCCCCAC ACCGCGCACC TTCTGCGGGA ACGTGCTCGC CGTGCCGGGG ACCATATGGA 180 

CGGAAGGCTT TGTGCTCACC TACAAGCTGG GTGAGCAGGG TGCCAGCAGC CTGTTGATCC 240 

TCTTGGCTCC TGCTGGAGCA CGAGCGGCGT TTCTGCTCCC GAGTTGGGAC TGTGGAATGG 300 

TGTGGGTGCT GTGGTCTGCT CCATCGCTGG CTCCTCCCTG GGTGGGACCT TGCTGGCCAA 360 

GCACTQGAAA CTGCTGCCTC TGTGAGGTCG GTGCTGCGCT TCCGCCTCGG GGGCCTAGCC 420 
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TGTCAGACTG CCTTGGTCTT CCACCTTGGA CACCCTGGGG GCCAGCATGG ACGCTGGCAC 480 

AATdTGAGA GGGTCAGCCT TGCTGAGCCT ATGTCTGCAG CACTTCTTGG GARGCCTGGT 540 

-5 

CACCACAGTC ACCTTCACTG GGAATGATGC GCTGCAGCCA GCTGGCCCCC AGGGCCTTGC 600 

AGGCCACACA CTACAGCCTT CTGGCCACGC TGGAGCTCCT GGGGAAGCTG CTGCTGGGCA 660 

10 CTVTGGSCGG AGGGCCTGGC TGATGGGTTG GGGCCACATC CXrrarTTCTT GCTCCTGCTC 720 

ATCCTCTCTG CCTTTCCCX3T TCTGTACCTG GACCTAGCAC CXIAGCACCTT TCTCTGAGCT 780 

GAGTGGCTGG AGTGGTCAAT AAAGCCACAT GTGCCTGTGG CXTCAAAAAAA AAAAAAAAAA 840 

15 

AAAAAAAAAA AAAAAAACTG GAGGGGGGGC CCGGTACCCA AATCGCCGGA TATGATCGTA 900 

AACAATC 907 

20 

(2) INFORMATION FOR SEQ ID NO: 196: 

25 (i) SEQUENCE CHAKACTERISTICS: 

(A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: dovible 

(D) TOPOIiOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 
GGCACGAGGA GGGACAGGGA GTGGGCAAGG GGAAGAAGCA GCTTATTTGA CTJ\ACCAGCC 60 
35 CCTCTGTGGT CCACCAGCGT CTTGGCTTGG TGGGAGGGCT CTCAATCAGC AGGGCCCCAG 120 
^CAGGGCAT^GA -AGAAGTGGGG CAAAGCCTGG GGCTCGGCGG- CGGTGGGGGC- AGGTTTGGMA- - -180 
TCTGGAGCCA CGCCTCCTCC AGGCCATGCT CCTTGAACTT GGAAATGTCA ACCGGAGCCC 240 

40 

TTAACACCAG CCCTCCAGCA TCTAATAGAC TTGAATCTAC TCTAAACGAA TATTTAATCC 300 
AACCTCAACT ACATTGTAGC TCAGTCCAAC GACTAACCCT GAAATGGGQG TGTTCCAGCC 360 
45 TTCAGCGAGA TGGCCAAGCG GTCCCCTGGG GGCTGTGGCA GCGGGCTTAT CCTTCTCTGT 420 
TGCCAACCTT GCCGTCCGAC CTCCTCCGCC CCCATGCGGT GACCCCGTCC GTGTCTGTGT 480 
CTGTCCATAC GTGTGAGTCC AGCTAAAAAG ACAAAACAGA ACCCGTGGGC CCAGCTCGGA 540 

50 

AGGTGCGTGG AGAAGGCTCC GACGTCTCCG AAGTGCAGCC CTTGGGATGG CATTCCGTTG 600 
TGTGCCrrAT TCCTGGAGAA TCTGTATACG GCTCGCCTAT AAGAAATATA GCCTCTTCAT 660 
55 GCTGTATTAA AAGGACTTTT AAAAGCAAAA AAAAAAAAAA AAAAACTCGA GGGGGGGCCC 720 
GGTACCCAAT TCGCCCAATA GTGAGTCGTA TTACAATTCA CTGGGCCGTC STTTTAACAA 780 
CGTCGTGAAC TGGGAAAACC CTGGCGTTTA CCCAACTTAA TCGCCTTGCA GCACATCCCC 840 

60 
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CTTTCGCCAG CTGGCGTTAA TAGCGAAAAA 
TTTGCGCAGC CCTGAATGGC GAAATGGCAA 
-5 TCCNCGTTWA AWrrrTTGTT TAAATCARCT 
CGGCAAATCC CCYTTATTAA TTCCAAAAAA 
TTCCCCAYTT TTGGAAACAA AWTYCCCCCT 

10 

AAAGGGGAAA AAACSYTTTT YTGGGGGGNA 
CCAAWCAATT TTTTTGGGGG GTCCCNAAAG 
15 AGGGGANCCC CCCATTTAAA ATTTTNGGTN 



428 

NGCCCGCACC CGAATCGCCX: TTCCCAACAG 900 

ATTGTAAGCG TTTAATATTT TKKTTAAAAT 960 

CAATTTTTTT AACCCAATAA GSCCGAAATC 1020 

ATAAACCSAA AAWCX3GTTTG AATTTTTrKT 1080 

TTTTAAAAAA GTTCGAACCC CCAMCCYTCC 1140 

ANGGGGCCCC CNTACTTTNA ACAYCCCCCC 1200 

GTCCCCCTAA AANCTTTTTT CGGAACCXrNA 1260 

1290 



20 (2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
30 GGTGTGCCTG GATGGTCGTG TAGGTGAGTT TTACCAAGGA TTATGGTAAC AAATGAGTGA 60 

GACCTCTATG GAGAAAATAT TGAAGNNCAT TAAAGAAGAC CTCATANTAG GAGAGAATGT 120 

SCTTTGGAGG ATTTGTATTG AGCTTTTACA GTATTCATTT TTCAACTCAA GGCAATGGCT 180 

35 

TTCTACACCA ACTCTi^TCC ATTAAOSGGT.CrTATC CTATQAAGTiPLJiTA<3C^ 240- 

ATGCTTAGTG TGTATTTCTC TCTTTGAGAC ACTGTAATTT CTACCAGAAA TTTCCAGAGC 300 

40 ATTATGTAGG TAGAAAAAAA TGCAAGCAAG CTGTTAAAGA TCTTGGATCC CATTATATAG 360 

TATGTATAGC TQAAATCTGT AATTCAATCA Ci'X'l'riir i 'Cr TTTATCCTCT AACCAAAAAA 420 

TTGTTTAATT TTGCATCCCA AATGTTTTTA ATCTTTGTAT ATTTTTTAAA AAYCCTTTTC 480 

45 

TCCTCATCAT TGCCTTTTTT GTGGTTGTAA ATAGACTTAC TTGCACTTTG AAGATGAGTT 540 

ACTCCTTGTC ATCTTACAAA TATGTGATAT GGTAATTTTC ATAACAGATG TCAGTTTTGA 600 

50 ACCAAGAATT GGTGATTTGT TTATAAGAAA AAAACTGGCT TCAITTCTGT GAAATTGCTC 660 

TTTGAAAATT TCTTTTTACA CGTGTAAGCC AACTGAGATA CCGTGATGGT GTTGATTTCT 720 

TTCAATGATG CTTACCATCT ATTTTAGCCA CTGAGCCTTT TATTATTTGT CTATTTGTAA 780 

55 

AGTTTATTTG TCTTAACTCA TTTAATAAAT ATACTGTTTA TCTGTTTCTG AATGGGGACT 840 

GAACTTTTTG GATATTGATA TTGATTTGAA AATATTTTGG AATTTTTTCT ACTTGAAATT 900 

60 TTAGAAATCT AATKGAAAAT TCTATAATGT ACTGAAAGTA WGGTTGTGTA CAGTGAKCAC 960 
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TCTCTAATAA TATGATGNCT TGCCCTAAAN GAGGNGGGAC ATGTCCCACT TTCCACXiACXS 1020 



.(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 524 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

AATTCCCGAA GCTGAGGGTT GTGTGCCNTC GGGCGAGCCA AGTCTTTTGA CCGGACCCTT 60 

CCCGGCGCAG AAGANCTGAA GTTGATTTGA GAGCCTGTKT ITGGGGTrRA GCCGAGCTGC 120 

20 

TGCGGGCTTY GTCGCCGGCC AGGACACAAG YTACTTGCAA CGGGGCGGCG CCTGGCTTAT 180 

GATGTTCCTC AACCCAGGGG CGGCCTCTGC CCTCTACTCG TGCCAGGCCC ACTTGCCAGG 240 

25 CAGGAGCCCT CCCCAAGCCT TCAGGGCTGC TCGGAGTCAC CTGTTGGAAT GGACTAAAAG 300 

GACCCTTGTG TGGGAACAGG TGCTCCAAAC ACCCTGCTGC TGGCTGCCAG GCAGGCCCTC 360 

TGGAAGGGAA GGGGCAGGAC TCATCAGGAC CTCCCTGGAC CCTGCAGGGC AGGCAGTTGG 420 

CCCGAGCCCA AGCATTTGGC TCTGCTTGCC CCAAGGGGAC AGGAAGCCTC TTGGGCCTCT 480 

TCCCTTCCTG GACAAGGCCC CCTGCCTTTG CCTCACATAA ACTG 524 



30 
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(2) INFORMATION FOR SEQ ID NO: 199: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 
GTGATACAAG GAAGGGTGAT CATCATCTGT CACCATGCAA TTCCTGCTCA CAGCCTTTCT 60 
50 GTTGGTGCCA CTTCTGGCTC TTTGTGATGT CCCCATATCC CTAGGCTTCT CCCCCTCCTA 120 
GAAGGGCTTC TTGATAGATT AGAAAATAAG AATGAGTGAC ATTTCCTATG TGCATATAAG 180 
AAGGAGCCAC AAGACATGTC TTTTAAATAA AAGGACAGTG TCCATCCTTT TAGCTGCCGA 240 
ATAGAACCTT GGTCTCATCC TCCTGGAGCT AGGSCTTAAA ACAGCTTCTG TGTTTCTSAT 300 
TKGTCTCART GTTTTGCCAA GGTTTTATTC GG 332 



60 
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(2) INFORMATION FOR SEQ ID NO: 200: 

_ 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIjOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 



CCAGGGAAGC CCCARGCCTG TCCTGAATTG ACATCAGTGC TTCCCTGAAC TGCCTCCCCC 60 

15 ACCCCTGGGC ATTATCCCAG GAAACTTATG TTTTCTAGAA GCTAAGCAGC TGCTGGGACT 120 

CAGGGACTGG TGCAGGTAGG CTGAGTGGCA GCTCAGTCCT AGAAGGTCTC TGAAGATCTG 180 

GACTGAGGAC CYTGCTACTC CCCAAGCCAG AGCCCATCAG CCAGGCCTGC TGTGAGCCAC 240 

20 

CTGCCTCTCG AGTGCTGAGC TCAACCAAAG GCTGGCAAGC TCTGGGCCTC ATTTAAQGGA 300 

TTCTGATGAG CCGATGGGCC CTGGAGGCAG CCCATTAAAG CATCTGGCTC GTTTTTGGAA 360 

25 AAAAAAAAAA AAAAAG 376 



30 (2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1192 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201; 



40 CCCAGTATAT TTCTATAACA TTTATTTTAG TGAACTTATA ATGITTCTTT GTATTAAATT 60 

ATTAGATTAT ATCTTTAGAT AATATTGTTA CTNAATTAGT AGGTAATATA TATTTTATTC 120 

AAAAATAAAT TGTGCATCTA ATGTCTACCA ATTAATGTAC TTGTAGATGT ATCTTATCTT 180 

45 

AACTTGAGTC TTTGCTGCCC CTAATGAGGT GTGAAGGACT CTTCTCCCCT GGQGAAGTTT 240 

TTCTTTTTCA GGAGGGAGGA GGGCTTTCCC AGGTAATGTG TCTAGAGTGT TGGGCAGAAR 300 

50 AATCTGGGAC CACACCACAC CAGTTCTCTC CTTAATCCAC GTCATTTGCC TTCTATCCCA 360 

GCTATGTTTC CAGTGTCCTC TGGGTGTTTC CAAGAGCAAC AAGAAAYGAA TAAATCTCTG 420 

KTGAGTTGTT TATTTGTTCT TCACTTTGTT TTACACTGTA WTTTCTGAGT TTATGGGTGT 480 

55 

CTGTGAATTA AAAAGGAAAA GTRGAAATAA GTAAAACTCA GGTTGAAGGA AATATACATA 540 

AATAAGATAA AGCTGACCTG TAGATATARR CAGGTTATAA RAGCTTAGAG TTGTCTAAGT 600 

60 TGRGTGCAAA KTTTCCTCTG ATCTTTCTGA TGCCGARACA AAAAAGGCAG TCATGTTTGT 660 
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15 



20 



WATGTGATTG 
TGYGTGCACC 
TTCACACCGG 
CTAAAGCACG 
CGTAATGTAT 
TGATTCATGG 
ACTCTCCACA 
AGTCATATTT 
CCTTATTTCT 



GAATGGAACC 
AAGTCTKAAC 
CCTGGCAGTA 
GCGTGTTGTG 
GCTCCTTACT 
ATGAAACCTG 
CGTTTTTCTT 
TCTGGGAAAT 
TCTAAACTCA 



CGARAAGAGA 
CACCACCTTC 
AACACTTGTA 
CAG(rGGAAAT 
CAAAGAGTGT 
GAACATCTTG 
AGTITATCTC 
ATTTCCAGTG 
CCATTAATCT 



431 



GCAYGCTGTG TTCITGGGGA CAGGAAAGCT 720 

ATGGGACATA GRTTATGTGC TGGAACATAT 780 

GTGTTGTGCA GTGGAAACGG TCATCTTCCG 840 

GGTCATCTGC TCCTAAAACA CAGCTTCCAT 900 

GGTCCCAAAC AGCCTTTGGG AGGTCCTCCT 960 

AGGACTGAGT TAACCATAGG TCCTTAAATA 1020 

TACATGCAGG GTGTGCAGCA GCCTGTTCAA 1080 

TTTATTTGCA CTTTAGCCCA CTCTGTGTAG 1140 

GAATAATAGT CAAATTTAGG GG 1192 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 589 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
ATCTTGGGCT ATCTTTGACA GGGGATTCTT GCAAGTTGAT 
GTCAGTCCCC AAAGATGGAG AGCTTGAGTT CTCACAGAAT 
-CACAGATTGA- GQATACGGAA CCCATGTCTC CAGTTCTCAA 
AAAATGATAG TATCCTGATG AATCCAGCAC AGGATGGTGA 
ATQACAAAAC AAAGGGAGAT GATACAGACA CCMGGGATGA 
GOTGCAAGGG CAGAGAAGAA ACGGTAGCAG AAGATGTTTG 
CGQGGAGTCA GGCAGTTCCG TCACCAGCTA CTCGATCTGA 
ATCAGGAGGA AGCTATGGAA ATTAAAGAAC ACCATCCAGA 
AGGTGGAAGA AATCCCTGAG ACACCTTGTG AAAGTCAAGG 
ATATGGAGAG TGTTCCGTTG CACCTTTCTC TGACTGAAAC 



202: 

GCTTTCTACA AGTGAATATA 60 

TGATGAAGAT GGAGAAAACA 120 
TTCTAAATrr GTTCCTGCTG " "180 

AGTACAACTG AGTCAGAATG 240 

CATTAGTATT TTAGCCACTG 300 

TATTGATCTC ACTTGTGATT 360 

GGCACTTTCT AGTGTGTTAG 420 

GGAGGGGTCT TCAGGGTCTG 480 

AGAGGAACTC AAAGAAGAAA 540 

TCAGTCCCA 589 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 847 base pairs 

(B) TYPE: nucleic acid 



SDOCID: <WO 983944aA2 tA> 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 

-5 

GGCACGAGCG CAAGCTGCTG GCCQCCATCA ACGCGTTCCG CCAGGTGCGG CTGAAACACC 60 

GGAAGCTCCG GGAACAAGTG AACTCCATGG TGGACATCTC CAAGATGCAC ATGATCCTGT 120 

10 ATGACCTGCA GCAGAATCTG AGCAGCTCAC ACCGGGCCCT GGAGAAACAG ATTGACACGC 180 

TGGCGGGGAA GCTGGATGCC CTGACTGAGC TGCTTAGCAC TGCCCTGGGG CCGAGCAGCT 240 

TCCAGAACCC AGCCAGCAGT CCAAGTAGCT GGACCCACGA GGAQGAACCA GGCTACTTTC 300 

15 

CCCAGTACTG AGTGGTGGAC ATCGTCTCTG CCACTCCTGA CCAGCCTGAA CAAAGCACCT 360 

CAAGTGCAAG GACCAAAGGG GGCCTGGCTT GGATGGGTTG GCTTGCTGAT GGCTGCTGGA 420 

20 GGGGACGCTG GCTAAAGTGG GGAGGCCTTG GCCCACCTGA GGCCCCAGGT GGGAACATGG 480 

TCACCCCCAC TCTGCATACC CTCATCAAAA ACACTCTCAC TATGCTGCTA TGGACGACCT 540 

CCAGCTCTCA GTTACAAGTG CAGGCGACTG GAGGCAGGAC TCTTGGGTCC CTQGGAAAGA 600 

25 

GGGTACTAGG GGCCCGGATC CAGGATTCTG GGAGGCTTCA GTTACCGCTG GCCGAGCTGA 660 

AGAACTGGGT ATGAGGCTGG GGCGGGGCTG GAGGTGGCGC CCCCTGGTGG GACAACAAAG 720 

30 AGGACACCAT TTTTCCAGAG CTGCAGAGAG CACCTGGTGG GGAGGAAGAA GTGTAACTCA 780 

CCAGCCTCTG CTCTTATCTT TGTAATAAAT GTTAAAGCCA GAAAAAAAAA AAAAAAAAAA 840 

AAAAAAA 847 

35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 204: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 

ACAAACATAC TCGCAGGAAG GAGTCTCATG CTGCCCGCAG CATCAGCGCA ACNCNTGGCC 60 

GCCATCAACG CGTTCCGCCA GGTGCGGCTG AAACACCGGA AGCTCCGGGA ACAAGTGAAC 120 

TCCATGGTGG ACATCTCCAA GATGCACATG ATCCTGTATG ACCTGCAGCA GAATCTGAGC 180 

55 AGCTCACACC GGGCCCTGGA GAAACAGATT GACACGCTGG CGGGGAAGCT GGATGCCCTG 240 

ACTGAGCTGC TTAGCACTGC CCTGGGGCCG AGGCAGCTTC CAGAACCCAG CCAGCAGTCC 300 

AAGTAGCTGG ACCCACGNAG GAGGAACCAG GCTACTTTCC CCAGTACTGA GGTGGTGGAC 360 

60 
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ATNCGTCTCT TGCCACTCCN TGNACCCAGC CCTGAACAAA GCACCTCAAG TGCAAGGACC 420 

AAAGGGGGCC CTGGCTTGGA GTGGGTTGGC TTGCTGATGG CTGCTGGAGG GGACGCTGGC 480 

> TAAAGTGGGK AGGCCTTGGC CCACXTTGAGG CCCCAGGTGG GAACATGGTC ACCCXICACTC 540 

TGCATACCCT CATCAAAAAC ACTCTCACTA TGCTGCTATG GACGACCTCC AGCTCTCAGT 600 

TACAAGTGCA GGCGACTGGA GGCAGGACTC CTGGGTCCCT GGGAAAGAGG GTACTAGGGG 660 

CCXTGGATCCA GGATTCTGGG AGGCTTCAGT TACCGCTGGC CGAGCTGAAG AACTGGGTAT 720 

GAGGCTGGGG CGGGGCYGGA GGTGGCGCCC CCTGGTGGGA CAACAAAGAG GACACCATTT 780 

15 TTCCAGAGCT GCAGAGAGCA CCTGGTGGGG AGGAAGAAGT GTAACTCACC AGCCTCTGCT 840 

CTTATCTTTG TA 852 



10 



20 



(2) INFORMATION FOR SEQ ID NO: 205: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1354 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linecir 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

GATTCGGCAC GAGGCTTGCT GGAGCAGGAG AAGTCTCTRG CCGGCTGGGC ACTGGTGCTG 60 

GCASGARCTG GCATTGGACT CATGGTGCTG CATGCAGAGA TGCTGTGGTT CGGGGGGTGC 120 

35 

TCGGCTGTCA ATGCCACTGG GCACCTTTCA GACACACTTT GGCTGATCCC CATCACATTC 180 

CTGACCATCG GCTATGGTGA CGTQGTGCCG GGCACCATGT GGGGCAAGAT CGTYTGCCTG 240 

40 TGCACTGGAG TCATGGGTGT CTGCTGCACA GCCCTGCTGG TGGCCGTGGT GGCCCGGAAG 300 

CTGGAGTTTA ACAAGGCAGA GAAGCACGTG CACAACTTCA TGATGGATAT CCAOTATACC 360 

AAAGAGATGA AGGAGTCCGC TGCCCGAGTG CTACAAGAAG CCTGGATGTT CTACAAACAT 420 

45 

ACTCGCAGGA AGGAGTCTCA TGCTGCCCGC AGGCATCAGC GCAANCTGCT GGCCGCCATC 480 

AACGCGTTCC GCCAGGTGCG GCTGAAACAC CGGAAGCTCC GGGAACAAGT GAACTCCATG 540 

50 GTGGACATCT CCAAGATGCA CATGATCCTG TATGACCTGC AGCAGAATCT GAGCAGCTCA 600 

CACCGGGCCC TGGAGAAACA GATTGACACG CTGGCGGGGA AGCTGGATGC CCTGACTGAG 660 

CTGCTTAGCA CTGCCCTGGG GCCGAGGCAG CTTCCAGAAC CCAGCCAGCA GTCCAAGTAG 720 

55 

CTGGACCCAC GAGGAGGAAC CAGGCTACTT TCCCCAGTAC TGAGGTQGTG GACATCGTCT 780 

CTGCCACTCC TGANCCCAGC CCTGAACAAA GCACCTCAAG TGCAAGGACC AAAGGGGGCC 840 

60 CTGGCTTGGA GTGGGTTGGC TTGCTGATGG CTGCTGGAGG GGACGCTGGC TAAAGTGGGK 900 
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AGGCCTTGGC CCACCTGAGG CCCCAGGTGG GAACATGGTC ACCXXX:ACTC TGCATACCXTT 960 

CATCAAAAAC ACTCTCACTA TGCTGCTATG GACGACCTCC AGCTCTCAGT TACAAGTGCA 1020 

-5 

GGCGACTGGA GGCAGGACTC YTGGGTCCCT GGGAAAGAGG GYACTAGGGG CCCGGATCCA 1080 



GGATTCTGGG AGGCTTCAGT TACCXXTTGGC CGAGCTGAAG AACTGGGTAT GAGGCTGGGG 1140 



10 CGGGGCTGGA GGTGGCGCCC CCTGGTGGGA CAACAAAGAG GACACCATTT TTCCAGAGCT 1200 



GCAGAGAGCA CCTGGTGGGG AGGAAGAAGT GTAACTCACC AGCCTCTGCT CTTATCTTTG 1260 



TAATAAATGT TAAAGCCAGA AAAAAATAAA AAAAAAAAAA AAAAAACTTCG AGGGGGGCCX: 1320 

15 

AGACCCAATC TCCCTATAGT AAGNCGCCNN ANAN 1354 



20 

(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) UENGTH: 1378 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

30 

TCCCCAGGTG CACAGCCAGG GCCCTCCTGT CTGCAGGAGA ATTCACAGCT GGTGTGGGAC 60 

TCAGCCCCTA GNCCATTCAA AGCCTTAATG TTGTAATCAT ATCTTACGTG TTGAAGACCT 120 

35 GACTGGAGAA ACAAAATGTG CAATAACGYG AATTTTATCT TAGAGATCTG TGCAGCCTAT 180 

TTCTGTCACA AAAGTTATAT TGTCTAATAA GAGAAGTCTT AATGGCCTCT GTGAATAATG 240 

TAACTCCAGT TACACGGTGA CTTTTAATAG CATACAGTGA TTTGATGAAA GGACGTCAAA 300 

40 

CAATGTGGCG ATGTCGTGGA AAGTTATCTT TCCCGCTCTT TGCTGTGGTC ATTGTGTCrT 360 

GCAGAAAGGA TGGCCCTGAT GCAGCAGCAG CGCCAGCTGT ANATAAAAAA TAATTCACAC 420 

45 TATCAGACTA GCAAGGCACT AGAACTGGAA AAGACCACAG AAAACAAAGA ATCCAACCCT 480 

TTCATCTTAC AGGTGAACAA ACTGTGATGA TGCACATGTA TGTGTTTTGT AAGCTGTGAG 540 

CACCGTAACA AAATGTAAAT TTGCCATTAT TAGGAAGTGC TGGTGGCAGT GAAGAAGCAC 600 

50 

CCAGGCCACT' TGACTCCCAG TCTGGTGCCC TGTCTACACC AGACAACACA GGAGCTGGGT 660 

CAGATTCCCC TCAGCTGCTT AACAAAGTTC CTCGAACAGA AAGTGCTTAC AAAGCTGCCT 720 

55 TCTCGGATAC TGAAAGGTCG AGTTTTCTGA ACTGCACTGA TTTTATTGCA GTTGAAAAAA 780 

AAAAAAAGCT ATTCCAAAGA TTTCAAGCTG TTCTGAGACA TCTTCTGATG GCTTTACTTC 840 

CTGAGAGGCA ATGTTTTTAC TTTATGCATA ATTCATTGTT GCCAAGGAAT AAAGTGAAGA 900 

60 
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AACAGCACCT TTTAATATAT AGGTCTCTCT 
GTGACAATTT TCATATTCTC ATTCTTAAAA 
5 TGAGAATAAG TTACACACAA TGGCCACAGC 
TCATGTAATC GGTTACTCAC TACTGCCTTT 
CATGAGACAG GATTATAGTG CCTTAACCGA 

10 

AACTGCTCTT CTGCTCTAGT ACCATGCTTA 
TGCTTGTTCT TATTTTAATA AATTTATCAG 
IS AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



435 

GGAAGAGACC TAAATTAGAA AGAGAAAACT 960 

AACACTAATC TTAACTAACA AAAGTTCTTT 1020 

AGTTTGTCTT TAATAGTATA GTGCCTATAC X080 

AAAAAAAAAA ACCAGCATAT TTATTGAAAA 1140 

TATATTTTGT GAdTAAAAA ATACATTTAA 1200 

GTGCAAATGA TTATTTCTAT GTACAACTGA 1260 

AGTGAAAAAA AAAAAAAAAA AAAAAAAAAA 1320 

AAAAAAAAAA AAAAAAAGAA NAAAAMAA 1378 



20 (2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1166 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

30 AANCCACTGC AOTTTAAACC CCCTCCCCTC CAAGAAAGTT CACAACCGGC CATGGATGAC 60 

CCTCATTTTA GATGGGCCNC AATATTTAAG ATGGACTGRG GMCCCCAJRAG ACTGACCCTT 120 

GAAAGGGGGA CTCAGAAGAA AGATCCTTGA CATTGCCMAA CATGCTGGGC TTGTCCAACA 180 

35 

CAGTGATGCG GCTCATCGAG AARCGGGCTT TCCMAGGACA AGTACTTTAT GATAGGTGGG 240 

ATGCTGCTGA CCTGTGTGGT CATGTTCCTC GTGGTGCAGT ACCTGACATG AGCCAGCCAC 300 

40 GCTCAGTGGC TGAACAGCAT TCCCACAGCC TGCAAGTGTG TGTGTGTGTG AAAGAGAGAG 360 

GGGGCCCAGA GGCCGCCTTT TGAAATGTTT GCCTGTCTGA ACTGTGAAGA CACTTGQGAG 420 

TGATTGTGGT CTAATTTCCA ACCTGCTCTG TTTTCTGTGA CATCTTGGAG GGGGAGCTAG 480 

45 

TGCCAMCACC ATGCGCGGTG CTTAGGAAAT GAAAGAAGTC CCGOGTCTGT CTCTCTCACT 540 

CTCGCTCTCA MTGGGGGAGG GAAAGAATGG CTTTGGTGGC TTTGTTCACA CAGCTGATGC 600 

50 GTC3SCCTGGG AAGGTGTCCA CAGTGAGCCC TGTGTGCAGG ACTGTCCACN ACGGTTCACA 660 

CCTTGTCACC ATCAGGCCTT TCTGGCTCCT GATAGGGTGG AGCAAAAGTG GAAAGGAAAG 720 

GAAAGAGGCY TTTTCTCACA GCCATTATAT TAAATAGTAG GTCGATTCAC ATCYTCGTGC 780 

55 

TCCTGGCCAC CCTCCCCTGT GCCTCAGTGA CATGTAGATG ACTGACTGCC AATACTTGTC 840 

ACCATTCCCT GGAAGCAGCT ACCTAGGGGA AACAAGATGT AGTGCTATTG CCGATAACAA 900 

60 GTAAGATTTT CCACACTACA GCTGGGTGTT TCTCTTTTCT AAAGTGAGGC CAGTGTTATT 960 
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TCCCGGGAGT GTTCAGTCTT GACCCTAGTC ACTGATTTTT TCTAGTTGTT AATAGAGTGG 1020 
TTGGGCTTTT AAGGTTCAGA GACTGTGGGC TTGGGCACCT GCGCCCAGGG STTTTGTGGG X080 

-5 

GGCCTTTGCC CCTTAGRAAA GTAGCTTTTA GGGGCAAAGA TTTGTTGATT TTCCCCATTA 1140 
CAGTCTTCAG CTCNAGGGTT TTAAAA 1166 

10 

(2) INFORMATICS^ FOR SEQ ID NO: 208: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 697 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 
TACTTCTAGG ATTATAAGGA ATTAACATTG AGATGACATT TCCATTTGAG AAGGAAAATA 60 
25 GTrGCTTTCA GTGCCTTTTA TTTGATTCCT GGAGAGAGCA GACTCGCACS AACATTCAAC 120 
CCCAGCGCTG ATATGACAGT AATCCTCAGA GGCAGAGCCC AGCACAAAAC AGCAATGCTA 180 
GAAAGTTACA ATTGGAAAGT TTCCTGCCAG CTTCGGGAAT GACACTGCAA AGCTGATGCC 240 

30 

AGAAACTGCC AGRGTAATTC TCCTCATTAC TGCTCTACCC ACCCACTTTC AGCTCCCCAA 300 
ATTAACTAGT GCAGTTGACT AATTCTCTTT ACCTTTATCA TTTARGGTGA RGCATTGCAC 360 
35 AAAAACTCTC GACTTTGCCA TATAAGGGCT GTGGTTCTCT GTGGTCCCCT GGATAAGAGG 420 



CATCACCATT ATCTGGAAAC ATGCAGTAAA TGCAGATTNT TCATCTTCTC CCCAGACCTC 480 

CTGAGTTAGA AATTCACAAG TTCTCCAGGT GATCTCATAC ATGCTAAAGT TTGAGAACCA 540 

40 

TTGAGTAAAG TTAATGCATT AAGAAGAGAT TAGATAGGGA TGGTGGCGTA TCTTCCTACA 600 

GTTTCCCTGT TAACAAGAAA GTCAGAGGTC AGTTGATCAG ACATTAGATT ATTTATTGCT 660 

45 AAAACTAAAA AAAATTATJA AAAACTQGAG GGGGGCC 697 



50 (2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEI»IESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 



60 



CGTGAGTCAC CTCTCTATAG TGGGCGTGGC CGAGGCCGGG GTGACCCTGC CGAAGCCTCC 60 
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GCTGCCAGAA ACCATGTTCA AGGTAATTAA AAGGTCCGTC3 GGGCCAGCCA GCCTGAGCTT 120 

GCTCACCTTC AAAGTCTATG CAGCACCAAA AAAGGACTCA CCTCCCAAAA ATTCCGTGAA 180 

-5 

GGTTGATGAG CTTTCACTCT ACTCAGTTCC TGAGGGTCAA TCGAAGTATG TGGAGGAGGC 240 

AAGGAGCCAG CTTGAAGAAA GCATCTCACA GCTCCGACAC TATTGCGAGC CATACACAAC 300 

10 CTGGTGTCAG GAAACGTACT CCCAAACTAA GCXTCAAGATG CAAAGTTTGG TTCAATGGGG 360 

GTTAGACAGC TATGACTATC TCCAAAATGC ACCTCCTGGA TTTTTTCCGA GACTTGGTGT 420 

TATTQGTTTT GCTGGCCTTA TTGGACTCCT TTTGGCTAGA GGTTCAAAAA TAAAGAAGCT 480 

AGTGTATCCG CCTGGTTTCA TGGGATTAGC TGCCTCCCTC TATTATCCAC AACAAGCCAT 540 

CGTGTTTGCC CAGGTCAGTG GGGAGAGATT ATATGACTGG GGTTTACGAG GATATATAGT 600 

20 CATAGAAGAT TTGTGGAAGG AGAACTTTCA AAAGCCAGGA AATGTGAAGA ATTCACCTGG 660 

AACTAAGTAG AAAACTYCAT GYTCTGCCAT CTTAATCAGT TATRGGTAT^ CATTGGAAAC 720 

TCCATAGAAT AAATCAGTAT TTCTACAGAA AAATGGCATA GAAGTCAGTA TTGAATGTAT 780 

25 

TAAATTGGCT TTCTTCTTCA GGAAAAACTA GACCAGACCT CTGTTATCTT CTGTGAAATC 840 

ATCCTACAAG CAAACTAACC TGGAATCCCT TCACCTAGAG ATAATGTACA AGCCTTAGAA 900 

30 CTCCICATTC TCATGTTGCT ATrTATGTAC CT 932 



35 (2) INFORMATION FOR SEQ ID NO: 210: 

- - - - - -(!-) - SEQOENCEr CHARACTERISTICS! 

(A) LENGTH: 661 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

45 GTCATTCTTT AAATAAAAGC TTTCCTGTTT AAAGCTTTTC AAAGGAGCAG ACCACCTTGA 60 

AGATTCCCCC TAGGGTTGAT ATGTGTCTAA TTCATTTTAT AAAAATTATT CTTGTCTTCA 120 

TTTTAAAGCT TTGGCTATAT AGTCAGAAAT GTCCTAAATA ACAAACTATT TTGTATTTAA 180 

50 

TTTAGGGAAG ACTAAAGGGA AGAAAAATGA AAACTCAGTC TTTATGTAAG CTCCAAGGAT 240 

ATTAGGGCTT AAAGGGCTTT TCTAGTTTTA TGAGAATTTG TACTACTGAT TTTTATATAT 300 

55 TCCTGTTTTT GAGATGAACA GATCTCTGGG GAAATTGTTG AGTTACAATG QCATTTCACT 360 

GTGATCCCTC TCAAGCTCAG ATCAGTTCTA TAACCCAATG ACAACCTGTC TCTTTGGTTT 420 

ACTGTCCTGT GAAATGTCAG CTCAAGTTTC CCAGAAGTCG TGTGTTTATG ATGAGTCAGA 480 

60 
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GTGCTTTTCC TCGGTGGGAC AGTTGCTGGC CCTCTTAATT TTGGTGTATG TGCTTCCAAG 540 

TATCTAAACC TCCAGTCTGA TCTGTATATG CTATCCTAAC TGTTAATTGT ATTATTGATT 600 

-5 ATCTTGATTA TCTTGCTTGA AGGTTCATAC TTTTCAATTT GATAGAAATA AAGTTTTTTT 660 

C 661 



10 



(2) INFORMATION FOR SEQ ID NO: 211: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 592 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

GAAACTGACA TTGTTAAACA CACTAAAACA GAAGTACTTA CCTCTTGAAG ATTTAATATA 60 

TAATGGTTGA CATGATACAT GTACATGAAT GGAATGACCA GATGCTTATG GTCTACATTT 120 

25 

TCCTTTATCC TGTTAGTATT ACCTTCCTTA ATCTTTGTTC CTTAACATGC TAAATTCCTC 180 

TTCAGTGTTT ATTTTCTAGT GACAGAATGC TAACATTTCT TACACCCTGG CAGAAGQGAG 240 

30 AGAAATGTGT TTTGGGGTGG GTAACTAAAT TTTTGAGTGA AATATCATAA GATGAGAATG 300 

GAAAGAGQGA GACACAAAGA GTTATAACAA AAAAACAATG GTTmTTAG CCATTTGACT 360 

GGCTCTTTAA ATAGTCTACA AGACATTCAC GTTNAACATC ACTTTTAGTG AAATAAAATG 420 

35 

_ . _ - . _ TGCCATACTA GTATGTGCTT CAAAAGGGCA -AATGTGCTTT-AGTGCGGTAA-GGCTAAATTT- 480- 

TGGTCATTTG ACATCAGAGA TGTTGTAAGT ATTGCACTTA ATACGCACCT ATTTCTCAAT 540 

40 AGTOn^ATTT TTTTGGCTAG CATTTNCTTT ACCACTAACC TTGTTGGATA GC 592 



45 (2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 938 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

{D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 
55 TGGAGTGGCT TTCCAGCTGA ATGAATCCTA TGTCTCGCGT GCAOGTGGTT GGTTTTCAAT 60 
GTTCTTSCTA ATTTTTTrCC TATTGGCTCT TGGGAGTTTN CnTGTTTGC TCCTGTGTTT 120 
GCCCAGCTTT AATAAAACCA GGCGCAAACA AAAACCATAG CATTCTGAAA CAATAGGGGG 180 

60 
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CCCACATTGG ACCCAGTATG TCACTTTAAT 
TGACACTAGG AATGTATACT CCACACATTT 
_5 TTGTTTCTTG TQGCGAAATG TGGCTTTCAA 
TTTGTTTKGA CTKGTATAAT TAAGGGTTTG 
AACCAGGAGA TACAAAGAAG TCTCAGTAGT 

10 

ACATTTAAGR ATGTATTAGT TACAGAAATT 
AAGTACATGC CTCCACATAA TGCGGTGCTG 
15 TTATGACAGG CACACAGAAA CCATAGCATG 
TCCTGGAACC TTATTTTGCT AAATGTCTGT 
CATTGTGACC ATGGTGATGC CTCATTTGCA 

20 

CATTTTCATT GAAGAGTCTG ATGACTTGCT 
GCTGAAACCA AACCAGGCTT TTAAAAACCT 
25 GGTGTCCmT CCCTGTCAAA CTCATTAAAA 
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GGACTTCAAG AAAAAATCTG AATGGGAAAA 240 

TATGCCATAT AATGGTGTGT TTTCTTAATT 300 

ATTAAAATQ4 CCTTTTCTTC TTKGAAACTT 360 

GAAAGATTCA TAATTMTGAG AGAGGTTTGC 420 

AATCTTGTTC ATGTGCTTTT ACAGCCAGCT 480 

ATATGTCTGT GTATGTGTCT CTACTCAATA 540 

TCCATCTCGG CAAATACTGG CCAAGTCCCT 600 

GTCTGGCTTT CAGAAAATGC CTCTCATCTT 660 

TTTCTTGTGA TTTGTTGTAC CTCACAGCAC 720 

TGATATGTAC CTTGTGTTTA ATGTGAAATA 780 

AGCGTTTTAT TTTTTCTGTA AGGTCAATGT 840 

GTGTAGAAGA AAACCAAAAA ATCCTGTGTG 900 

ATTCCTTT 938 



30 (2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1079 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 



40 AGCCTGCCGG GAGAGTGGTG GCATCTRARA GGCTGGTCGT GGACTGTGGT TGGGGGAGGT 60 

GGGAGCTGTT TTAACCGTGT GCCCCCTCTC CTGTGCCKGC GTGGGCATCC CCCGGGGCAG 120 

TGGAACGCGG GCGCTCCTCC AGCTTCCGAG TCCAGCCAGC CTGGGCGCGG GGCGCGCCCC 180 

45 

CGAGACACCC GAGGAGTCCG TTCCTCCCTG GTTACGTGGA CTGTGGAGCT GGTCTCTTGT 240 

GGCTCAGCGC CGTGCGGAGG TTGAAGCGTA CCTGCGGAGG TCGCACCAGG GGCGTGAGGA 300 

50 GGAGGAGGAA GGGCATGAGC CGAGCTTGAG GAATCCGTGY TCCAAACTCT ACACTCAAOG 360 

RTGCMCTGCG CAACTCTGGT GGCGATGGGC TGGGGCAGAT GTCCTTGGAG TTCTACCAGA 420 

AGAAGAAGTC TCGCTGGCCA TTCTCAGACG AGTGCATCCC ATGGGAAGTG TGGACGGTCA 480 

55 

AGGTGCATGT GGTAGCCCTG GCCACGGAGC AGGAGCGGCA GATCTGCCGG GAGAAGGTGG 540 

GTGAGAAACT CTGCGAGAAG ATCATCAACA TCGTGGAGGT GATGAATCGG CATGAGTACT 600 

60 TGCCCAAGAT GCCCACACAG TCGGAGGTGG ATAACGTGTT TGACACAGGC TTGCGGGACG 660 
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TGCAGCCCTA CCTGTACAAG ATCTCCTTCC AGATCACTGA TGCCCTGGGC ACCTCAGTCA 720 

CCACCACXZAT GCGCAGGCTC ATCAAAGACA CCCTTGCCCT CTGAGCGTCG CTGGATCTCT 780 

GGGAGCTCCT TGATGGCTCC CAGACCTTGG CTTTTGGGAA TTGCACTTIT GGGCCTTTGG 840 

GCTCTGGAAC CTGCTCTGGG TCATTGGTGA GACTTGGAAG GGGCAGCCCC CGCTGGCTTC 900 

10 TTCGTTTTGT GGTTGCCAGC CTCAGGTCAT CCTTTTAATC TTTGCTGACG GTTCAGTCCT 960 

GCCTCTACTG TCTCTCCATA GCCCTGGTGG GGTCCCXTTT CTTTCTCCAC TGTACAGAAG 1020 

AGCCACCACT GGGATGGGGA ATAAAGTTGA GAACTITGAGT TTGGGCTGAA AAAAAAAAA 1079 

15 



20 



(2) INFORMATION FOR SEQ ID NO: 214: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

TGAAGCAGGC GCTCTTGGCT CGGCGCGGCC CGCTGCAATC CGTGGAGGAA CGCGCCGCCG 60 

30 

AGCCACCATC ATGCCTGGGC ACTTACAGGA AGGCTTCGGC TGCGTGGTCA CCAACCGATT 120 

CGACCAGTTA TTTGACGACG AATCGGACCC CTTCGAGGTG CTGAAGGCAG CAGAGAACAA 180 

35 GAAAAAAGAA GCCGGCGGGG GCGGCGTTGG GGGCCCTGGG GCCAAGAGCG CATCAGGGCC 240 



40 



GCGGCCCAGA CCAACTCCAA CGCGGCAGGC AAACAGCTGC GCAAGGAGTC CCAGAAAGAC 300 

CGCAAGAACC CGCTGCCCCC CAGCGTTGGC GTGGTTGACA AGAAAGAGGA GACGCAGCCG 360 

CCCGTGGCGC TTTAAGAAAG AAGGAATAAG ACGAGTTGGA AGAAGACCTG ATCAACAACT 420 

TCAGGGTGAA GGGAAAATAA TTGATAGAAG ACCAGAAAGG CGACCACCTC GTGAACGAAG 480 

45 ATTCGAAAAG CCACTTGAAG AAAAGGGTGA AGGAGGCGAA TTTTCAGTTG ATAGACCGAT 540 

TATTGACCGA CCTATTCGAG GTCGTGGTGG TCTTGGAAGA GGTCGAGGQG GCCGTGGACG 600 

TGGAATGGGC CGAGGAGATG GATTTGATTC TCGTGGCAAA CGTGAATTTG ATAGGCATAG 660 

50 

TGGAAGTGAT AGATCTTCTT TTTCACATTA CAGTGGCCTG AAGCACGAGG ACAAACGTGG 720 

AGGTAGCGGA TCTCACAACT GQGGAACTGT CAAAGACGAA TTAACTGACT TGGATCAATC 780 

55 AAATGTGACT GAGGAAACAC CTGAAGGTGA AGAACATCAT CCAGTGGCAG ACACTGAAAA 840 

TAAGGAGAAT GAAGTTGAAG AGGTAAAAGA GGAGGGTCCA AAAGAGATGA CTTTGGATGA 900 

GTCGAAGGCT ATTCAAAATA AGGACCGGGC AAAAGTAGAA TTTAATATCC GAAAACCAAA 960 

60 
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TGAAGGTGCT GATGGGCAGT GGAAGAAGGG ATTICTTCTT CATAAATCAA AGAGTGAAGA 1020 

GGCTCATGCT GAAGATTCGG TTATGGACCA TCATTTCCGG AAGCXIAGCAA ATGATATAAC 1080 

GTCTCAGCTG GAGATCAATT TTGGAGACCT TGGCCGCCCA GGACGTGGCG GCAGGGGAGG 1140 

ACGAGGTGGA CGTGGGCXSTG GTGGGCX3CCC AAACCGTGGC AGCAGGACCG ACAAGTCAAG 1200 

TGCTTCTGCT CCTGATGTGG ATGACCCAGA GGCATTCCCA GCTCTGGCTT AACTGGATGC 1260 

CATAAGACAA CCCTGGTTCC TTTGTGAACC CTTCTGTTCA AAGCTTTTGC ATGCTTAAGG 1320 

ATTCCAAACG ACTAAGAAAT TAAAAAAAAA AAGACTGTCA TTCATACCAT TCACACCTAA 1380 

AGACTGAATT TTATCTGTTT TAAAAATGAA CTTCTCCCGC TACACAGAAG TAACAAATAT 1440 

GGTAGTCAGT TTTGTATTTA GAAATGTATT GGTAGCAGGG ATGTTTTCAT AATTTTCAGA 1500 

GATTATGCAT TCTTCATGAA TACTTTTGTA TTGCTGCTTG CAAATATGCA TTTCCAAACT 1560 

TGAAATATAG GTGTGAACAG TGTGTACCAG TTTAAAGCTT TCACTTCATT TGTGTTTnT 1620 

AATTAAGGAT TTAGAAGTTC CCXCAATTAC AAACTGGTTT TAAATATTGG ACATACTGGT 1680 

TTTAATACCT GCTTTGCATA TTCACACATG GTCAACTGGG ACATGTTAAA CTTTGATTTG 1740 

TCAAATTTTA TGCTGTGTGG AATACTAACT ATATGTATTT TAACTTAGTT TTAATATTTT 1800 

CATTTTTGGG GAAAAATCTT TnTCACTTC TCATGATAGC TGTTATATAT ATATGCTAAA 1860 

TCTTTATATA CAGAAATATC AGTACTTGAA CAAATTCAAA GCACATTTGG TTTATTAACC 1920 

CTTGCTCCTT GCATGGCTCA TTAGGTTCAA ATTATAACTG ATTTACATTT TCAGCTATAT 1980 

TTACTTTTTA AATGCTTGAG TTTCCCATTT TAAAATCTAA ACTAGACATC 1TAATTGGTG 2040 

-AAAGTTGTOT-AAAeTAeTTA- TTGTTQGTAG- GeACATCGTG- TCAAGTGAAG-TAGTTTTATA a iOO" 

GGTATGGGTT TTTTCTCCCC CTTCACCAGG GTGGGTGGAA TAAGTTGATT TGGCCAATGT 2160 

GTAATATTTA AACTGTTCTG TAAAATAAGT GTCTGGCCAT TTGGTATGAT TTCTGTGTGT 2220 

GAAAGGTCCC AAAATCAAAA TGGTACATCC ATAATCAGCC ACCATTTAAC CCTTCCTTGT 2280 

TCTAAAACAA AAACCAAAGG GCGCTGGTTG GTAGGGTGAG GTGGGGGAGT ATTTTAATTT 2340 

TTGGAATTTG GGAAGCAGAC AGCTTTACTT TGTAAGGTTG GAACAGCAGC ACTATACATG 2400 

AAATATAAAC CAAAAACCTT TACTGTTTCT AAATTTCCTA GATTGCTATT ATTTGGTTGT 2460 

AAGTTGAGTA TTCCACAGAA AGTGGTAATT ATCTCTTCTC TCTTCCTCCA TTAGAAAATT 2520 

AGGTAAATAA TGGATTCCTA TAATGGGAGC ATCACCACTT ATTAAAACAC ACATAGAATG 2580 

ATGAATTAAA AAAGTTTTCT AGGATTGTCT TTTATTCTGC CACATTTATT GATAAACAGT 2640 

GAAGGAATTT TTAAAAAATT TTTAAGAATT GTTTGTCACG TCATTTTTAG AAATGTTCTA 2700 

CCTGTATATG GTAATGTCCA GTTTTAAAAA TATTGGACAT CTTCAATCTT AAACATTTCT 2760 
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ATTTAGCTGA TTGGTTCTCA CATATACTTC TAAAAGAAAC TTTTATGTTA TAAGAGTTAC 2820 

TTTTTGGATA AGATTTATrA ATCTCAGTTA CCTACTATTC TGACATTTTA GGAAGGAGGT 2880 

-5 AATTGTTTTT AATGATGGAT AAACTTGTGC TGGTGTTTTG GATCTTATGA TGCTGAGCAT 2940 

GTTCTGCACT GGTGCTAATG TCTAATATAA TTTTATATTT ACACACATAC GTGCTACCCA 3000 

GAGATTAATT TAGTCCATAT GAACTATTGA CCCATTGrTTC ATTGAGACAG CAACATACGC 3060 

10 

ACTCCTAAAT CAGTGTGTTT AGACTTTTCA AGTATCTAAC TCATTTCCAA ACATGTACCA 3120 

TGTTTTATAA ACCTCTTGAT TTCCAGCAAC ATACTATAGA AAACACXTTGC TACTCAAAAC 3180 

15 ACAACTTCTC AGTGTCATCC ATTGCTGTCG TGAGAGACAA CATAGCAATA TCTGGTATGT 3240 

TGCAAGCTTT CAAGATAGCC TGAACTTAAA AAGTTGGTGC ATTAGTTGTA TCTGATGGAT 3300 

ATAAATTTGC CTCCTAGTTC ACrTTGTGTC AAGAGCTAAA ACTGTGAACC TAACTTTCTC 3360 

20 

TTATTGGTGG GTAATAACTG AAAATAAAGA TTTATTTTCA TGCTCACTTC TTAAAAGTCA 3420 

TAAAAACAAT CAAATAGGRT CATGTTTATT GTCATGTCTT TCCTGGKTTC TGACCTGTCT 3480 

25 GCACACCCCT GTGTGTTTAT AATTTTTAAA TTGAATTTTA TATGGGGTTT TTATTTGCTA 3540 

AAAACCAGGC TGTTGAATCA CATTTGQGAA GGGTACTTAT CTTAATGACT AATGACTTAA 3600 

TTGGGAAAGT TGAATTCTTG TAAAATACAA AATCCAAGGA CTTCTTGGGA TTTAATCTAA 3660 

30 

TTGTCACTTC NTTAGGCAGA TNCACTTTTT TGGATAATGG AAAGTTAAGC ATACCGAATG 3720 

CTACTTTTGG TTGACAAACG GGCCTAATAG TCCGGGQGGA AATCCCTAAC NGGTAAGGNT 3780 

35 CCCAAGTATG G 3791 



40 (2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1334 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEr»aESS: doiJble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 



50 CAGTGCTCGC TCCTGCTCGG GGCGCTGCGG CCCCGGGCGT CGCCATGACC AGTGAGCTGG 60 

ACATCTTCGT GGGGAACAGA CCCTTATCGA CGAGGACGTG TATCGCCTCT GGCTCGATGG 120 

TTACTCGGTG ACCGACGCGG TGGCCCTGCG GGTGCGCTCG GGAATCCTGG AGCAGACTQG 180 

55 

CGCCACGGCA GCGGTGCTNC AGAGCGACAC CATGGACCAT TACCGCACCT TCCACATGCT 240 

CGAGCGGCTG CTGCATGCGC CGCCCAAGCT ACTGCACCAG YTCATCTTCC AGATTCCGCC 300 

60 CTCCCGGCAG GCACTACTCA TCGAGAGGTA CTATGCCTTT EyVTGAGGCCT TTGTTCGGGA 360 
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GGTGCTGGGC AAGAAGCTGT 
AACAGGCATC ACCCTCAAGA 
GGTGCTAGAG GAAATGCGQG 
TGACCGGTTG GCCAGGGACT 
AGGGAAGAAA AAACTGCAGT 
GATCCAAAAC TGGACCCTTG 
AGACAAGGAA TTTCTCCAGG 
TCTGGACCTG CACAAGAGCC 
TGAGATGGAA GCCAACTTCA 
GACCCACAAT AAAGATGTCA 
CTGCCGCTCC GACCACTGGC 
GTCTGTCCAC TCCCTCGATG 
CTCCGCGGCT GCCTCCTGCG 
GCTGACAATA AAGTTGCTCT 
GGCGTGCAGA TGTGCCTGTG 
GTGTNCNGTG GCTCCTTGGG 
CATCCACCAG AAGA 



CCAAAGGCAC CAAGAAAGAC CTGGATGACA TCAGCACCAA 420 

GCTGCCGGAG ACAGTTTGAC AACTTTAAAC GGGTCTTCAA 480 

GCTCCCTGGT GGACAATATT CAGCAACACT TCCTCCTCTC 540 

ATGCAGCCAT CGTCTTCTTT GCTAACAACC GCTTTGAGAC 600 

ATCTGAGCTT CGGTGACTTT GCCTTCTGCG CTGAGCTCAT 660 

GAGCCGTCGA CTCACAGATG GATGACAJTGG ACATGGACTT 720 

ACTTGAAGGA GCTCAAGGTG CTAGTGGCTG ACAAGGACCT 780 

TGGTGTGCAC TGCTCTCCGG GGAAAGCTGG GCGTCTTCTC 840 

AGAACCTGTC CCGGGGGCTG GTGAACGTGG CCGCCAAGCT 900 

GAGACCTGTT TGTGGACCTC GTGGAGAAGT TTGTGGAACC 960 

CACTCAGCGA CGTGCGGTTC TTCCTGAATC AGTATTCAGC 1020 

GCTTCCGACA CCAGGCCTCT GGGACCGCTA CATGGGCACC 1080 

CCTGTATCAT GACTGAGGTG CCTCCCAACG CTCCGCCCAC 1140 

GAGTTTGGAG ACTGGTCCTC GCTCCGGGGA GCAAGTQGGG 1200 

TCTOTCTCTG AGCACCTGGT GTCCGTGTAC AAGGATQGAT 1260 

AACTGAGACA TATCTCAGGG AATGGTGTCT GTGCTCAGCC 1320 

1334 



(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1511 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

GTGGCGGGGA TGCTGCGAGG GGGTCTCCTG CCCCAGGCGG GCCGGCTGCC TACCCTCCAG 60 

ACTGTCCGCT ATGGCTCCAA GGCTGTTACC CGCCACCGTC GTGTGATGCA CTTTCAGCGG 120 

CAGAAGCTGA TGGCTGTGAC TGAATATATC CCCCCGAAAC CAGCCATCCA CCCATCATGC 180 

CTGCCATCTC CTCCCAGCCC CCCACAGGAG GAGATAGGCC TCATCAGGCT TCTCCGCCGG 240 

GAGATAGCAG CAGTTTTCCA GGACAACCGA ATGATAGCCG TCTGCCAGAA TGTGGCTCTG 300 

AGTGCAGAGG ACAAGCTTCT TATGCGACAC CAGCTGCGGA AACACAAGAT CCTGATGAAG 360 
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RTCTTCCCCA ACCAGGTCCT GAAGCCCTTC CTGGAGGATT CCAAGTACCA AAATCTQCTG 420 

CCCCTTTTTG TGGGGCACAA CATGCTGCTG GTCAGTGAAG AGCCCAAGGT CAAGGAGATG 480 

GTACGGATCT TAAGGACTGT GCCATTCCTG CCGCTGCTAG GTGGCTGCAT TGATGACACC 540 

ATCCTCAGCA GGCAGGGCTT TATCAACTAC TCCAAGCTCC CXAGCCTGCC CCTGGTGCAG 600 

GGGGAGCTTG TAGGAGGCXTT CACCTGCCTC ACAGCCCAGA CCCACTCCCT GCTCCAGCAC 660 

CAGCCCCTCC AGCTGACCAC CCTGTTQGAC CAGTACATCA GAGACSCAACG CGAGAAGGAT 720 

TCTGTCATGT CGGCCAATGG GAAGCCAGAT CCTGACACTG TTCCGGACTC GTAGCCAGCC 780 

TGTTTAGCCA GCCCTGCXXTA TAAATACACT CTGCGTTATT GGCTGTGCTC TCCTCAATGG 840 

GACATGTGGA AGAACTTGGG GTCGGGGAGT GrrGrTTGTCA CTTGGTTTTC ACTAGTAATG 900 

ATATTGTCAG GTATAGGGCC ACTTGGAGAT GCAGAGGATT CCATTTCAGA TGTCAGTCAC 960 

CGGCTTCGTC CTTAGTTTTC CCAACTTQGG ACGTGATAGG AGCAAACTCT CTCCATTCTC 1020 

CAGGTCCAAG GCAGAGATCC TGAAAAQATA GGGCTATTGT CCCCTGCCTC CTTGGTCACT 1080 

GCOTCTTGCT GCACGGGCTC CTGAGCCACC CCCTTGGGGC ACAACCTGCC ACTGCCACAG 1140 

TAGCTCAACC AAGCAGTTGT GCTGAGAATG GCACCTQGTG AGAGCCTGCT GTGTGCCAGG 1200 

CTTTGTGCTG AGTGCTGTAC ATGTATTAGT TCCTTTACTG CTGACCACAT TGTACCCATT 1260 

TCACAGAGAA GGAGCAGAGA AATTAAGTGG CTTGCTCAAG GTCATGCAGT TAGTAAGTGG 1320 

CAGAACAGGG ACTTGAACCA AGCCCTCTGC TCTGAAGACC GCGTCCTGAA TTTCTTCACT 1380 

AGAGCTTCCT CATCAGGTTA CCCAGAAGTG GGTCCCATCC ACCATCCAGG TGTGCTTGGA 1440 

TGTTAGTTCT CCACCCTCGA GGTGTACGCT GTGAAAAGTT TGGGAGCACT GCTTTATAAT 1500 

AAAATGAAAT A 1511 



45 



(2) INFORMATION FOR SEQ ID NO: 217: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
50 (D) TOPOLOGY: linear 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

AGGCCTTACT TTTCCTCCCA CAAAGGAGTC GCAGCCACGC TAGCTCTGAC TTGCCACTGT 60 

GACAAAGTTC ACGTAGCAGG TCTAGGCAAA GACTGGGCAA TTGAGCAGAG GAGACGGACC 120 

TGTGAGTCTG ACCRYGAGSC GGRCCCCTTC ACCTTGGCTG GGCTGGTCCT GGTCCTTAGG 180 

60 TTTTGTCAGG TTGTCCTTGT TTGGATCCCT CAACTAGGTG ATAAGCACTG GAGGGGGATG 240 
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ACCCGCCTTG GACGTGaTTC TTTAACCTCA TCCATATAAT AGGGCCGTGG GATGGTTGTA 300 

GAGGTAAAGC AGGATGATGG TGTTTTAAGA CX:AGAGCTTG GGACCAGGGC TCCTACACCT 360 

AATTTTCTCT CCTGGTAGCT GAACAAAGGT CTAAATTAGC TTAACAAAAG AACAGGCTGC 420 

CGTCAGCCAG AGTTCTGAAG GCCATGCTTT CAGTTTCCCT TGTTGACAAT TGCTCTCCAG 480 

10 TTCCTATGAA AGCACAGAGC CTTAGGGGGC CTGGCCACAG AACACAACCA TCTTAGGCCT 540 

GAGCTGTGAA CAGCAGGGGG TTGTGTGTCT GTTCTGTrTC TCTGCTTGCC GAACTTTCTC 600 

AATAAACCCT ATTTCTTATT TTATATTTAC GTNGGTGCTG GG 642 



15 
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(2) INFORMATION FOR SEQ ID ITO: 218: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 
25 (D) TOPOLOGY: linear 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

GGTCCCACTG TTCCATTTTA TGCTAATAGA TTCCATTCTA GGGCCCAGCC GTCTCTTGAC 60 

TGATGGTGTT CCCTTTAACC CTTGGCATGT ATAATAGAAT TTTGGTGAAT GAAAGAACCC 120 

AAATAGGCCA GATAGTCCCC CCAGGCCCTG ATATCCATAA AAGGCTTGGG AATGCATTAT 180 

35 GTAATTGTCC TTAGTCTTTT TGTTGTTTTA GAAAAAAAAA ACAAGATGGG CTCAGATGGA 240 

- - - - -TGCeTACGTA" AAAATGGTTC -eTASerGTGT- AeTCATAAeT-^ll'TCTTrGftA- TTGAGTAGTG 3^0- 

AAAGGAAGGA GGAGGAAAGG AAATTAAATG TCCTTCTAGT ATTCTCTGGA CTCAAGTCTG 360 

40 

ACATATGRGA TAATAACCTA TATTGAAATG CCAAGAATTG TATCTGAAAC AAGRGAACAG 420 

TTTGACACAT TTATCATGCC TTCATATTAC ATATTAACTG AAACCAATTA ATAAACATAT 480 

45 GAAATATCCA TTGCACAAGG CAAAGGCACC TAAACCTTTT GTTTCTTTTT CTACATAGCA 540 

GAAATTGATl' TTTTTTTTAT TTTTTTAGGG GAACCTATAT AATTATGACC CAGTGATGTC 600 

TTTTGGTGAC TTAAGCTTAT GAATTCAGGT TACAATTGAG TTGATTCTAG ATGGTTACTA 660 

50 

CCTTGAAAAG GATGTTGGTG CCTTATGTGA CACGAGCCAG AGCCTGCTGG GAATAAACAA 720 

AGCAGATTCA TGCCAACACC AACTCCTAGC TTTAGTGGCA GATGGGAGTG GTCACAGACT 780 

55 CCCAAAATGT GGGGCTTTGG ATTTCCACAC CATCCCACGT GTGTGTCATC TTCCTCTTTC 840 

ACACTCTTGA TGATAATTTG AAAATGRTGA AATCACCTCT GAATTTGCCT ATAGCATGAG 900 

CACATTCTTA TGACAACATA ACAAATAGTT CATAATGTGA ATATTAGAAA CTGTTACAGC 960 

60 
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CTGCAGTTAC CATAATTTTC CATGTTTGTG GAATTGATAT TGAAATAGCA GGGCTAAGGA 1020 

ATTACTGGCA AGTTTTAGCC TGTGGGTAAT ACCTTAGGGT TATTTAAATA TTTGTAATTT 1080 

- 5 TATTTAAATG TTCATGAATG TTTGAAAGGA ACAAAATTAT CAGGGATGGC TCTTTGCXAT 1140 

GGGTCTTATT TTCACCCTCT TTTCTGTAAG AAAAAAGAAC AATGTCTTAA TGTATTTTTA 1200 

AAGTTrTTGG TATAGTTTCT AATTCCAATT TTAATAAAAG T 1241 

10 
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(2) INFORMATION FOR SEQ ID NO: 219: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOIjOGY: linear 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

TGTTTATGTG ACCTAAAACA TACACACATG CACACACACA TACATATCCA TTCATTCATT 60 

CATTCAAGTG GTGTTTCCAG TGTCTGTGTG TCACTGTTTA TGCAGTTTCC ATTTCCCAGT 120 

GAATTATGAG TGGAGGGCAA CTTTTCTAAC CAGATTGTCT TTTCAGAACA AAGACCKGGG 180 

30 RATTGAGGAA GAGTTTGGAA AGAGGGAGAG GCAAGGAAAG AGAGCTTTAA ATTGAAAGGT 240 

TAATTTCCTA AGAOGAACCT GGGCIX3AATG ACTACAGTGT TATACCCTCC AATCTTTGCA 300 

GGTGGGCATG GAACACTGCT TGTATCACTC TGTGCACGGT ATAAATCCAT ATATCCACAA 360 

AAACACACAT CCATCCATCA ACATATACAT GGTTTGGGAT GAGCAGGTCA ATAGTTTTGA 420 



35 
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GAGGGAGTTT GTTCCTTTTT TTTTCTCATT ATACTCTTAA ATTGTTGTCA GTTATCAAAC 480 

40 AAACAAACAG AAAAATTGTT TGGGAAAAAC CTTGCATACG CCTTTTCTAT CMAGTGCTTT 540 

AAAATATAGA CTAAATACAC ACATCCTGCC AGTTTTTTCT TACAGTGACA GTATCCTTAC 600 

CTGCCATTTA ATATTAGCCT CGTATTTTTC TCACGTATAT TTACCTGTGA CTTGTATTTG 660 

45 

TTATTTAAAC AGGAAAAAAA ACATTCAAAA AAAGAAAAAT TAACTGTAGC GCTTCATTAT 720 

ACTATTATAT TATTATTATT ATTGTGACAT TTTGGAATAC TGTGAAGTTT TATCTCTTGC 780 

50 ATATACTTTA TACGGAAGTA TTACGCCTTA AAAATACGAA AATAAATTTT ACAAGGTTTC 840 

TGTTTTGTGT GGAAGAGTAA TTGATGTTGC TAAGAATGAT GTTTGTTTTT TTGGGGTTTT 900 

TGTTGTTTTT TTTTTAAATG TTACCAGCAC TTTTTTTGTA AGTTTCACTT TCCGAGGTAT 960 

TGTACAAGTT CACACTGTTT GTGAAGTTTG AATATGAAGG AATAATTAAA AAAAAAAAAA 1020 

AAACCNCGGG GGGGGCCCGG TCCCATTGGN CCCAAGGGQG CGGTTACGGG GTCACGGCCG 1080 
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(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1258 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 220: 

TGAATTGAGG GCTTAAAGAT AAACATATGG GRTTGGAGTT GTGTGTCCAT AGGGTTTCAC 60 

TGCCTATTTG ATTTCAGTTT ATCCCTATTA ATTTTTTACA GTGAAATTTT ATTAAAGTAT 120 

AATGTACATA TATTTTCAGT GGATTTTGCT CTGAAGGTTC TCCAGTGGTC TGACTACGAG 180 

ATAGTGCGGC TTCAGCTGTG GGATATTGCA GGGCAGGAGC GCTTCACCTC TATGACACGA 240 

TTGTATTATC GGGATGCCTC TGCCTGTGTT ATTATGTTTG ACGTTACC/^ TC3CCACTACC 300 

TTCAGCAACA GCCAGAGGTG GAAACAGGAC CTAGACAGCA AGCTCACACT ACCCAATGGA 360 

GAGCCGGTGC CCTGCCTGCT CTTGGCCAAC AAGTGTGATC TGTCCCCTTG GGCAGTGAGC 420 

CGGGASCAGA TTGACCGGTT CAGTAAAGAG AACGGTTTCA CAGGTTGGAC AGAAACATCA 480 

GTCAAGGAGA ACAAAAATAT TAATGAGGCT ATGAGAGTCC TCATTGAAAA GATGATGAGA 540 

AATTCCACAG AAGATATCAT GTCTTTGTCC ACCCAAGGGG ACTACATCAA TCTACAAACC 600 

AAGTCCTCCA GCTGGTCCTG CTGCTAGTAG TGTTTGGYTT ATTTTCCATC CCAGTTCTGG 660 

GAGGTCTTTT AT^TCTCTTC CCTTTGGTTG CCCACCTGAC MATTTTATTA AGTACATTTG 720 

^AATTGTeTeC-TGAePAeTST-eCAGTAAGSArGGeeeATTGT^ CAeTTAGAAA- AGAGAeeTOa 7-80~ 

AACCCAKGTG CATTTCTGCA TCTCCTGGAT TAGCCTTTSA CATGTTGCTG RCTCACATTA 840 

GTGCCAGTTA GTGCCTTCGG TGTAAGATCT TCTCATCAGC CCTCAATTTG TGATCCQGAA 900 

TTTTGTGAGA AGGATKAGAA ATCAGCACCT GCGTTTTAGA GATCATAATT CTCACCTACT 960 

TCTGAGCTTA TTTTTCCATT TGATATTCAT TGATATCATG ACTTCCAATT GAGAGGAAAA 1020 

TGAGATCAAA TGTCATTTCC CAAATTTCTT GTAGGCCGTT GTTTCAGATT CTTTCTGTCT 1080 

TGGAATGTAA ACATCTGATT CTGGAATGCA GAAGGAGGGG TCTGGGCATC TGTGGATTTT 1140 

TGGCTACTAG AAGTGTCCCA GAAGTCACTG TATTTTTGAA ACTTCTAACG TCATAATTAA 1200 

TCTTGGGCAT CAAGANTAGT TCCAATTTTT TGGGCCGGGG CAGGGTGG 1258 



60 



(2) INFORMATION FOR SEQ ID NO: 221: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

CACAATATAT GAAATAGTAC CCTCTAAAAA AGAGAAAAAA AAAATCAGGC GGTCAAACTT 60 

10 AGAGCAACAT TGTCTTATTA AAGCATAGTT TATTTCACTA GAAAAAATTT AATATCAAGG 120 

ACTATTACAT ACTTCATTAC TAGGAAGTTC TTTTTAAAAT GACACTTAAA ACAATCACTG 180 

AAAACTTGAT CCACATCACA CCCTGTTTAT TTTCCTTAAA CATCTTGGAA GCCTAAGCTT 240 

15 

CTGAGAATCA TGTGGCAAGT GTGATGGGCA GTAAAATACC AGAGAAGATG TTTAGTAGCA 300 

ATTAAAGGCT GTTTGCACCT TTAAC3GACCA GCTGGGCTGT AGTGATTCCT GGGGCCAGAG 360 

20 TGGCATTATG TTTTTACAAA ATAATGACAT ATGTCACATG TTTGCATGTT TGTTTGCTTG 420 

TTGAATTTTT GAACAGCCAG TTGACCAATC ATAGAAAGTA TTACTTTCTT TCATATGGTT 480 

TTTGGTTCAC TGGCTTAAGA GGTTTCTCAG AATATCTATG GCCACAGCAG CATACCAGTT 540 

25 

TCCATCCTAA TAGGAATGAA ATTAATTTTG TATCTACTGA TAACAGAATC TGGGTCACAT 600 

GAAAAAAAAT CATTTTATCX: GTCTTTTAAG TATATGTTTA T^TAATAAT TTATGTGTCT 660 

30 GCATATTGCA GAACAGCTCT GAGAGCAACA GTTTCCCATT AACTCTTTCT GACCAATAGT 720 

GCTGGCACCG TTGCTTCCTC TTTQGGAAGA GGAAAGGGTG TGTGAACATG GCTAACAATC 780 

TTCAAATACC CAAATTGTGA TAGCATAAAT AAAGTATTTA TTTTATGCCT CAGTATATTA 840 

TTATTTAATT TTTTAGGTAA TGCCTATCTC TTGGTCTATT AAGGAAAGAA GCAATCAGTA 900 



GAGAATTCAG GATAGTTTTG TTTAAATTCT TGCAGATTAC ATGTTTTTAC AGTGGCCTGC 960 

40 TATTGAGGAA AGGTATTCTT CYATACAACT TGTTTTAACC TTTGAGAACA TTGACAGAAA 1020 

TTATGCAATG GTTTGTTGAG ATACGGACTT GATGGTGCTG TTTAATCAGT TTGCTTCCAA 1080 

AGTGGCCTAC TCAAGAGGCC CTAAGACTGG TAGAAATTAA AAGGATTTCA AAAACTTTCT 1140 

45 

ATTCCTTTCT TAAACCTACC AGCAAACTAG GATTGTGATA GCAATGAATG GTATGATGAA 1200 

GAAAGTTTGA CCAAATTTGT TTTTTTGTTG TTGTTGTTGT TTTGAATTTG AAATCATTCT 1260 

50 TATTCCCTTT AAGAATGTTT ATGTATGAGT GTGAAGATGC TAGCGAACCT ATGCTCAGAT 1320 

ATTCATCGTA AGTCTCCCTT CACCTGTTAC AGAGTTTCAG ATCGGTCACT GATAGTATGT 1380 

ATTTCTTTAG TAAGAATGTG TTAAAATTAC AATGATCTTT TAAAAAGATG ATGCAGTTCT 1440 

55 

GTATTTATTG TGCTGTGTCT GGTCCTAAGT GGAGCCAATT AAACAAGTTT CATATGTATT 1500 

TTTCCAGTGT TGAATCTCAC ACACTGTACT TTGAAAATTT CCTTCCATCC TGAATAACGA 1560 

60 ATAGAAGAGG CCATATATAT TGCCTCCTTA TCCTTGAGAT TTCACTACCT TTATGTTAAA 1620 
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AGTTGTGTAT AATTGTTAAA ATCTGTGAAA GAATAAAAAG TGGATTTAA/* TTAPAAAAAA 
AAAAAAAAAA AAA 

(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIjOGy: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 
ACGCGTGGGT CGACCCACGC GTCCGCGACN TGGCGTGGTG GGGAAGGGAG AAGGATTTGT 
AAACCCCGGA GCGAGGTTCT GCTTACCCGA GGCCGCTGCT GTGCGGAGAC CCCCGGGTGA 
AGCCACCGTC ATCATGTCTG ACCAGGAGGC AAAACCTTCA ACTGAGGACT TGGGGGATAA 
GAAGGAAGGT GAATATATTA AACTCAAAGT CATTGGACAG GATAGCAGTG AGATTCACTT 
CAAAGTGAAA ATGACAACAC ATCTCAAGAA ACTCAAAGAA TCATACTGTC AAAGACAGGG 
TGTTCCAATG AATTCACTCA GGTTTCTCTT TGAGGGTCAG AGAATTGCTG ATAATCATAC 
TCCAAAAGAA CTGGGAATGG AGGAAGAAGA TGTGATTGAA GTTTATCAGG AACAAACGGG 
GGGTCATTCA ACAGTTTAGA TATTCTTTTT ATTTTTTTTC TTTTCCCTCA ATCCTTTTTT 
ATTTTTAAAA ATAGTTCTTT TCTAATGTGG TGTTCAAAAC GGAATTGAAA ACTGGCACCC 
~ CATCTCTTTG" AAACATCTGG 'T7ffiTTTGAAT~TCTRGTGCTC~ATTATTCATI^ 
TTCATTGTGC TGATTTTTGG TGATCAAGCC TCAGTCCCCT TCATATTACC CTCTCCTTTT 
TAAAAATTAC GTGTGCACAG AGAGGTCACC TTTTTCAQGA CATTGCATTT TCAGGCTTGT 
GGTGATAAAT AAGATCGACC AATGCAAGTG TTCATAATGA CTTTCGAATT GGCCCTGATG 
TTCTAGCATG TGATTACTTC ACTCCTGGAC TGTGACTTTC AGTGGGAGAT GGAAGTTTTT 
CAGAGAACTG AACTGTGGAA AAATGACCTT TCCTTAACTT GAAGCTACTT TTAAAATTTG 
AGGGTCTGGA CCAAAAGAAG AGGAATATCA GGTTGAAGTC AAGATGACAG ATAAGGTGAG 
AGTAATGACT AACTCCAAAG ATGGCTTCAC TGAAGAAAAG GCATTTTAAG ATTTTTTAAA 
AATCTTGTCA GAAGATCCCA GAAAAGTTCT AATTTTCATT AGCAATTAAT AAAGCTATAC 
ATGCAGAAAT GAATACAACA GAACACTGCT CTTTTTGATT TTATTTGTAC TTTTTGGCCT 
GGGATATGGG TTTTAAATGG ACATTGTCTG TACCAGCTTC ATTAAAATAA ACAATA 
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(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 
TCAGGGAGGT GGCAGGAAAG GCTTGGAACA GCTGCCGGAG TGACGGAGCG GCGGCCCCGC 60 

CCGGTTGCGC TGGAGGTCGA AGCTTCCAGG TAGCGGCCCG CAGAGCCTGA CCCAGGCTCT 120 

GGACATCCTG AGCCCAAGTC CCCCACACTC AGTGCAGTGA TQAGTGCGGA AGTCiAAGGTG 180 

ACAGGGCAGA ACCAGGAGCA ATTTCTQCTC CTAGCCAAGT CGGCCAAGGG GGCAGCGCTG 240 

GCCACACTCA TCCATCAGGT GCTGGAGGCC CCTGGTGTCT ACGTGTTTGG AGAACTGCTG 300 

GACATGCCCA ATGTTAGAGA GCTGGCTGAG AGTGACTTTG CCTCTACCTT CCGGCTGCTC 360 

ACAGTGTTTG CTTATGGGAC ATACGCTGAC TACTTAGCTG AAGCCCGGAA TCTTCCTCCA 420 

CTAACAGAGG CTCAGAAGAA TAAGCTTCGA CACCTCTCAG TTGTCACCCT GGCTGCTAAA 480 

GTAAAGTGTA TCCCATATGC AGTGTTGCTG GAGGTCTTGC CCTGCGTAAT GTGCGGCAGC 540 

TGGAAGACCT TGTGATTGAG GCTGTGTATG CTGACGTGCT TCGTGGCTCC CTGGACCAGC 600 

GCAACCAGCG GCTCGAGGTT GACTACAGCA TCGGGCGGGA CATCCAGCGC CAGGACCTCA 660 

GTGCCATTGC CCGAACCCTG CAGGAATGGT GTGTGGGCTG TRAGGTCGTG CTGTCAGGCA 720 

TTGAGGAGCA-GGTGAGGCGT- GGCAACCAAC- AGAAGGAGGA-GGAGGTGGGG-CTGAAGGAGG— ~ 780— 

AGATTGAGAG TGAGGTTGCC AACCTTAAAA AAACCATTAA AGTTACGACG GCAGCAGCAG 840 

CCGCAGCCAC ATCTCAGGAC CCTGAGCAAC ACCTGACTGA GCTGAGGGAA CCAGCTCCTG 900 

GCACCAACCA GCGCCASCCA GCAAGAAAGC CTCAAAGGGC AAGGGGCTCC GAGGGAGCGC 960 

CAAGATTTGG TCCAAGTCGA ATTGAAAGRA CTGTCGTTTC CTCCCTGGGG ATGTGGGGTC 1020 

CCAGCTGCCT GCCTGCCTCT TAGGAGTCCT CAGAGAGCCT TCTGTGCCCC TGGCCAGCTG 1080 

ATAATCCTAG GTTCATGACC CTTCACCTCC CCTAACCCCA AACATAGATC ACACCTTCTC 1140 

TAGGGAGGAG KCAAATGTAG GTCATGTTTT TGTTGGTACT TTCTGTTTIT TGTGACTTCA 1200 

TGTGTTCCAT TGCTCCCCGC TQCCATGCTC TCTCCCTTGT TTCCTTAAGA GCTCAGCATC 1260 

TGTCCCTGTT CATTACATGT CATTGAGTAG GTGGGTAGCC CTGATGQGGG TCGCTCTGTC 1320 

TGGAGCATAA CCCACAGGCG TTTTTTCTGC CACCCCATCC CTGCATGCCT GATCCCCAGT 1380 

TCCTATACCC TACCCCTGAC CTATTGAGCA GCCTCTGAAG AGCCATAGGG CCCCCACCTT 1440 

TACTCACACC CTGAGAATTC TGGGAGCCAG TCTGCCATGC CAGGAGTCAC TGGACATGTT 1500 
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CATCCTAGAA TCCTGTCACA CTACAGTCAT TTCTTTTCCT CTCTCTGGCX: CTTGGGTCCT 1560 

GGGAATGCTG CTGCTTCAAC CCCAGAGCCT AAGAATGGCA GCCGTTTCTT AACATGTTGA 1620 

5 

GAGATGATTC TTTCTTGGCC CTGGCCATCT CGGGAAGCTT GATGGCAATC CTGGAAGGGT 1680 

TTAATCTCCT TTTGTGAGTT TGGTGGGGAA GGGAAGGGTA TATAGATTGT ATTAAAAAAA 1740 

10 AAAAGGTATA TATGCATATA TCTATATATA ATATGACGCA GAAATAAATC T 1791 



15 (2) INFORMATION FOR SEQ ID NO: 224: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

ACACTAGTGG ATCCAAAGAA TTCQGCACAG CGGCACAGCA TTGTTGAGCT TTTCTGTGTG 60 

TGTGGGGCCC TCAAGCGAGC TCGACTGGTC CATCCTGGGG TAGCGASGTG GTGTTTGTGA 120 

AAAAGGACGA TGCCATCACC GCATAYAAGA AGTACAACAA CCGGTGTCTG GACGGGCAGC 180 

CGATGAAGTG CAACCTTCAC ATGAATGGGA ATGTTATCAC CTCAGACCAG CCCATCCTGC 240 

TGCGGCTGAG TGACAGCCCA TCAATGAAAA AGGAGAGCGA GCTGCCTCGC AGGGTGAACT 300 

CTGCCTCCTC CTCCAACCCC CCTGCCGAAG TGGACCCTGA CACCATCCTG AAGGCACTCT 360 

^'GAAGTGCTe -AGGGSGGTGT-KTGAeeAGGC- AGeeGAeAGA-WTTeAAAATG -AAGCTTTGAG- A2^- 

CAGGQGAGTR 7W3GCAGCCAG AAGTGGGGGC AGAGGAGGGT GGCTCTGTTT CCCCAAGGCA 480 

AAGCTTATGA CCAATGGGCC ATCGGACTGG AGACCCCTGA TTGTGGGAAG GGTTGCCAGG 540 

GATAAAGAGC TTCCTCACTG GATGGGACCC GCCTTTCTGT GTTGTGTTCT GCCCTGTGCT 600 

CTTCTCTCTA CGTTAACGTT TCCTGTAGTA TGTTTCTTCA TCTCATCGCC AAGGTAGGCT 660 

TGTGTTTTTM AGTGTGTGCC TCCCCGAGCC TCAGCCCCAA GCTGATTTCT TATCTGGAAA 720 

TGGTACACTG AATTCTCTGG GTGGCTTTCT TGTGGCCCCA TGGGATGCAG CGTGGGGGCT 780 

GTCTGAAGGA CCCTGCTTTT TCCAGGGGCC GAGGGGCTGC CTTTCCTTTG TGTGTATTAA 840 

GCTTTTCAAA CAATGGAGGG GATGGAGAGC CCTGGTGTCC TGACGGGAGC CAGGTCQGCC 900 

TGAGAGCTGT GCCGCTCCTC TGTCTTGTCA GTGGAGGTGC CTGGGTGGGG AGCAGGTCTC 960 

AQGCCTCTTG TCCTCTCCCC AGTGGCTCCA GGCCTCACTA GTGGCAAGGG CAGGATGAGG 1020 

CTGCACCGCT GGGAAGAGTC TATCTAAGCT CTTGGCTTGG AGTCCCGTGT CGTCTCCRCC 1080 
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CAGAGGAAGT TCTCCAGAGT TCACCTTTCC CTTTTCCTTG AGTTGTGCTG AATGCCCCAC 
CCCAGCTCTC TTTCCCTTCT GGGTGTCTTT GCTGGGAGGG GGCTGTGTTC TGAGCCCTCC 
CXX3TTCTCAC CTCGCCTGGC ACTTAACCAC ACCCTGGTTT TGTGTAGCCG CCAGCTCTCT 
TCTGGTTGGG CCTTTGAAAG GCTCAGCCTC CCATTGTOCA GTGCTTGGGT TTGGAGCTTA 
TTTGAATGGA AGAGGTCAGT TTGTTCCTGG CTCTCCATTT CTGGCCTCAG TTGTCTACAG 
GACAGTGGTC AGGGATGCCT GGAGGCATAT ATCCAGCTGC CACCAAGGGG CACTGTTTGT 
TCCCACTTAT GTGAGTGACC CCAOXrCATCC ATGACCAGAG GATTATTTTC CTGCCTTGGC 
AGAGGAGGAG GAGTCAAGGG AGCAGGGCAG CTCTACCAGG CAAGGTGTTT CCCCAGCATA 
GGCGCAGACA GTTGGGACX5A AACTTCAGAG CCCAGGCAGT CCCTGAATGA CCAGGCCAGT 
GTTGTCACTG AGTGGTCCCC TGCTGGTTGG GAGTGAAGAG AATCCAGGCT GGCAGAGCTG 
GAGCCAGTTG GGGAGCACGG TTCTGGGAGC TCTGCAAAAT CAGTAGCAAG TGCTGGAAAA 
GGCACATGCC GAAGATACTC AAGAGCTCCC AAGATTTGCT TGAGGCTAGC CCACTGAAAA 
AAACCAGAGA CrrCATGTTTC CAGGGGTCAG TCTGTCAGGC AGGAAGGACC CAGGATTTGA 
ACCCAGCTTC AGTGTGCAGG CTCTGAGGCT GCCCAGGACG GGAAAGTCCA AGGAAGGGGC 
CTGGTGGTGC TCCACTTGCA GTTCTTTAAA GAATGCTGCT TTTTATTCTC CTAACCCTTT 
CAAGTGGGTG CAGACTTCTC GTTAGCAGCT GGAAGACATT CCTCCCACAC TTTTCCCTTC 
CTGGCCCAAG AGAGCATCCA GAAGGCAGTA GGACCTGGTT TTTCAGGTAC TGGGAGCCGG 
GGGCTCACTG CTTGCACTGT GCTTAGGGTA GGGATGGTAA ATATCCTCCC TGCATGGCTT 



TATCCTCCCT CTCATCCCAA AGCAGGTATC TTCTGGTTGT CACAGAGTTT CATTGAGTCC 
AGCTGCAGCC ACXSTGGCCAT CTGGAGCTGG TGCTATAGGT GACCATCTGG TACATTGAGG 
GGACCTGTTT GCCTCCTCCA CTCTATAAGC AGTCATCTTG GGAGACCGGG AGGAGAAGGT 
GGTGGGCTAG TCCTGTGTCC TCCTCCACTT CCCATGCCTC TATGTTACCC ATCTGTGTCT 
CCTGTGCAGA AGGAGAGGAA GGGGCATTAA GAGATGAAGG GTGATTATGT ATTACTTATC 
CATTTCTGAA TAAACATTTG TTATTCCTAA AAAAAAAAAA AAAAAACTCG AGGGGGG 
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(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 
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TTGTANCTAA TCGAGGATTG ATTCTAATGA CAGAGTCTTT CAACACTTTG CACATGATGT 60 

ATCACGAAGC TACAGCTTGC CATGTGACTG GAGATTTAGT AGAACTTCTG TCAATATTTC 120 

5 

TTTCGGTITT GAAGTCTACA CXKTCCTTATC TTCAGAGAAA AGATGTGAAA CAAGCATTAA 180 

TCCAGTGGCA GGAGCGAATT GAATTTGCCC ATAAACTGTT AACTCTTCTT AATTCCTATA 240 

10 GTCCTCCAGA ACTTAGAAAT GCCTGTATAG ATGTCCTCAA GGAACTTGTA CTTTTGAGTC 300 

CCCATGATTT TYTTCATACT CTCGTTCCCT TTCTACAACA CAACCATTGT ACTTACCATC 360 

ACAGTAATAT ACCAATGTCT CTTGGACCTT ATTTCCCTTG TCRAGAAAAT ATCAAGCTAA 420 

TAGGAGGGAA AAGCAATATT CGGCCTCCGC GCCCTGAACT CAATATGTGC CTCTTGCCCA 480 

CAATGGTGGA AACCAGTAAG GGCAAAGATG ACGTTTATGA TCGTATGCTG CTAGACTACT 540 

20 TCTTTTCTTA TCATCAGTTC ATCCATCTAT TATGCCGAGT TGCAATCAAC TGTGAAAAAT 600 

TTACTGJ^C ATTAGTTAAG CTGAGTGTCC TAGTTGCCTA TGAAGGTTTG CCACTTCATC 660 

TTGCACTGTT CCCCAAACTT TGGACTGAGC TATGCCAGAC TCAGTCTGCT ATGTCAAAAA 720 

25 

ACTGCATCAA GCTTTTGTGT GAAGATCCTG TTTTCGCAGA ATATATTAAA TGTATCCTAA 780 

TGGATGAAAG AACTTTTTTA AACAACAACA TTGTCTACAC GTTCATGACA CATTTCCTTC 840 

30 , TAAAGGTTCA AAGTCAAGTG TTTTCTGAAG CAAACTGTGC CAATTTGATC AGCACTCTTA 900 

TTACAAACTT GATAAGCCAG TATCAGAACC TACAGTCTGA TTTCTCCAAC CX3AGTTGAAA 960 

TTTCCAAAGC AAGTGCTTCT TTAAATGGGG ACCTGAGGGC ACTCGCTTTG CTCCTGTCAG 1020 

35 

TACACACTCC CAAACAGTTA AACCCAGCTC TAATTCCAAC TCTGCAAGAG CTTTTAAGCA 1080 

AATGCAGGAC TTGTCTGCAA CAGAGAAACT CACTCCAAGA GCAAGAAGCC AAAGAAAGAA 1140 

40 AAACTAAAGA TGATGAAGGA GCAACTCCCA TTAAAAGGCG GCXSTGTTAGC AGTGATGAGG 1200 

AGCACACTGT AGACAGCTGC ATCAGTGACA TGAAAACAGA AACCAGGGAG GTCCTGACCC 1260 

CAACGAGCAC TTCTGACAAT GAGACCAGAG ACTCCTCAAT TATTGATCCA GGAACTGAGC 1320 

45 

AAGATCTTCC TTCCCCTGAA AATAGTTCTG TTAAAGAATA CCGAATGGAA GTTCCATCTT 1380 

CGTTTTCAGA AGACATGrCA AATATCAGGT CACAGCATGC AGAAGAACAG TCCAACAATG 1440 

50 -GTAGATATGA CGATTGTAAA GAATTTAAAG ACCTCCACTG TTCCAAGGAT TCTACCCTAG 1500 

CCGAGGAAGA ATCTGAGTTC CCTTCTACTT CTATCTCTGC AGTTCTGTCT GACTTAGCTG 1560 

ACTTGAGAAG CTGTGATGGC CAAGCTTTGC CCTCCCAGGA CCCTGAGGTT GCTTTATCTC 1620 

55 

TCAGTTGTGG CCATTCCAGA GGACTCTTTA GTCATATGCA GCAACATGAC ATTTTAGATA 1680 

CCCTGTGTAG GACCATTGAA TCTACAATCC ATGTCGTCAC AAGGGATATC TGGCAAAGGA 1740 

60 AACCAAGCTG CTTCTTGACA TTAGGTGTAG CATGTCTACT TTTAAGTCCC TCACCCXXAA 1800 
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CCCCCATGCT GTTTGTATAA GTTTTGCTTA TTTGTTTTTG TGCTTCAGTT TGTCCAGTGC 1860 

TCTCTGCTTG AATGGCAAGA TAGATTTATA GGCTTAATTC TTGGTCAGGC AGAACTCCAG 1920 

. 5 

ATGAAAAAAA CTTGCATCTT CAGTATACTT CCTAAAGGGC AATCAGATAA TGGATATGTT 1980 

TTATGTAATT AAGAGTTCAC TTTAGTGGCT TTCATTTAAT ATGGCTGTCT GGGAAGAACA 2040 

10 GGGTTGCCTA GCCCTGTACA ATGTAATTTA AACTTACAGC ATTTTTACTG TCTATGATAT 2100 

GGTC3TCCTCT GTGCCAGTTT TGTACCTTAT AGAGGCAGAT TGCCTCCGAT CGCTGTGGTT 2160 

CTTATTATCA AAATTAAGTT TACTTGTATA CGGAACAACC ACAAGAAATT TGATTCTGTA 2220 

15 

AAGAATCCTC TTTAGCTGTG GCCTQGCAGT ATATAAATGG TGCTTTATTT AACAGAATAC 2280 

CTGTGGAGGA AATAAAGCAC ACTTGATGTA AAAATAATTG TTTTATTTTT ATTGACATGA 2340 

20 CTGATTGATT GCTATTCTGT GCACTNAATT AAACTGATTG TGATGACTTA AAAAAAAAAA 2400 

AAAAAAAAAA AAAAAAAAAA AAAA 2424 



25 



(2) INFORMATION FOR SEQ ID NO; 226: 



<i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO; 226: 

ATATAGGACG"GMAA^^ TGCPXrrCA<^'^ 60 ' 

TAGGTGACAA GAAAACAAAG ATCTTATTCA AAAGAGGTCT TACAGCAACC CAACGTCTCA 120 

40 

TCTTCCCATA GTAAAGATGA CGGCGCCTTG AGGTAAGCTA CAGGCAACAC CACTTCCGCG 180 

TTTCTCTTGC GCCCTGGTCC AAGATGGCGG ATGAAGCCAC GCGACGTGTT GTGTCTGAGA 240 

45 TCCCGGTGCT GAAGACTAAC GCCGGACCCC GAGATCGTGA GTTGTQGGTG CAGCGACTGA 300 

AGGAGGAATA TCAGTCCCTT ATCCGGTATG TGGAGAACAA CAAGAATGCT GACAACGATT 360 

GGTTCCGACT GGAGTCCAAC AAGGAAGGAA CTCGGTGGTT TGGAAAATGC TGGTATATCC 420 

50 

ATGACCTCCT GAAATATGAG TTTGACATCG AGTTTGACAT TCCTATCACA TATCCTACTA 480 

CTGCCCCAGA AATTGCAGTT CCTGAGCTGG ATGGAAAGAC AGCAAAGATG TACAGGGGTG 540 

55 GCAAAATATG CCTGACGGAT CATTTCAAAC CTTTGTGGGC CAGGAATGTG CCCAAATTTG 600 

GACTAGCTCA TCTCATGGCT CTGGGGCTGG GTCCATGGCT GGCAGTGGAA ATCCCTGATC 660 

TGATTCAGAA GGGCGTCATC CAACACAAAG AGAAATGCAA CCAATGAAGA ATCAAGCCAC 720 

60 
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TGAGGCAGGG CAGAGGGACC TTTGATAGGC TACGATACTA TTTTCCTGTG CATCACACTT 780 

AACTCATCTA ACTGCTTCCC CGGACACCCT CCACCTCTAG TTGTTACTAA GTAGCTGCAG 840 

5 TAGGCATTGC TGGGGAAGAA ACAAACACAC ACCAAACAGT ACTGCTACTT AGTITCTAAG 900 

GCTGCACAGG GAAGGGA/^ ACTGGGCTTT GGACAATCTA GAGGTAATTT ATATCCGCCC 960 

CCAGGTGGAG CAACATGCGA TTCTGGAGGC ACGGGGGTAA CTGAAAGTGA CTACATATAG 1020 

10 

TCTTTCTGGT TTCTGGAGAT AACCCATCAA TAAAAGCTGC TTCCTCTGGG TAAAAAAAAG 1080 



15 

(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1336 base pairs 
20 <B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

25 

TTGCATTCAC AATTACTGGG AGGCAGGCAG GGGCAGTTGC ATGCTGGGGG TGGCTGCATG 60 

GSCTGCCASC TCTCCTGGGT TTGAAGGATG CGGTACASCT GCTTCAGCTG AGCAACGATG 120 

30 TTATCCTTGA TGTCTGGGGT TGAGATCTGC AGGCGGACAC TGCCACTATC AAAGGATCGT 180 

GTGAAATCAC CAGAAAACAT CTCGTAGATC ATCCGAGCCA CTACTGGAAT GACCTGAACC 240 

AAGATGAGTT TCCTTTCCAA TGGTTTCCCA TCTGGCCATT CTTCCCCAAA GCATAAGTAG 300 

35 

ATCTCAAACG GTGGCTGCTT CTCTATCTGT CCTTTCTGGT GGGCAATGAG ATCGCTAAGG 360 



AATGTTTCCA GACAAAATAG CTTGACCTTC TTTTGTCTCT CAATCAGGTT GGGAGCAACA 420 

40 AGTGATGGGG CACATGGCCC AGACCAGTAC ACCTTGCACT GGCACAGYCT GATGGCATAA 480 

ATGGCATGAC CGCTGACCTC CAGGATCAGT CCTCTGTCCA TGACGTCCAG CAGCTTGCTA 540 

GTGAACAGCT TCTGCTTCTC ATTGGTAATA TGCTCAGGAC CTGGGAATTT GACCTGCTCC 600 

45 

AGWCTGACGG GACCAAAGAG CTCCTCCTGG TCAGGCATGG GACCCAGGTC CCCATAGAAG 660 

AGTCGGCAGC CCTGAGGGTT GCTCACGGTC ATGGTCCTGC CCGTACTCCT TCCCACGGTA 720 

50 CTGAAACTTG ATGTCCAGGfT CAGTCATTGG GAGAGAGCTG ATCCACAGTT CTGGAGAGCT 780 

ATAGAAGGRC TGTATAGGTG CCTGGGGWAC TTCCATCTCC AGGQGTTCAG TTTTGGGCCA 840 

CACTGCCTCC GGSCTGCAGT TGCCCACACT GCAATTGCCC ACACTGGCTG GCGCCATGGG 900 

55 

AGAACCATTG ATGTTCAGGA AGGGGAAGGT GTCCTGGATG GGAACATGGT GCTGCGACTG 960 

ATCCAGCTCA TCTTCCTCAT CTTCTTCATC CACATCATTA TCCTTCTCAT CCCAGGGAGC 1020 

60 AGACCCTGTG GATCCTGGGT TAATGATCGA SCCCTGQGGC TGAGGGATGT CACACACTTG 1080 
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ATATATCTTC ACTGGGTTCA TGGGCACXTC CCTTGGTGCC ATCCATACAT CCAGGTTGAA 1140 

TTCTCTGCTC TTATTGAGAG CACAGCGCAG CTGGGCCTTC CATTTAGCTG GGTCAGGGTC 1200 

5 

ATCCACCCCT TCCTGGTACT TCCCTGTCTC TACAGCCCAG GCCTTAAAAA TGGTATTTTC 1260 

CTCTTCTTGT TGAGGGCTAT GCCGGGTGGC ATGTTTCCAG GGAATCTGGA AGCGTTTAGA 1320 

10 GTCCCTGTGT AGCCAG 1336 



15 (2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2043 base pairs 

(B) TYPE: nucleic acid 
20 <C) STRANDEDNESS: doxable 

<D) TOPOLOGY: lineso: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

25 TCAGCTGGTC CCTTCCTTGT GTCCTGGGGG ACCTGCTGGC GGCCTCTTCC TGGGAGCCAT 60 

GACCTCAGAC CCCACCCACA CTCCAGATCG AGACCCCTGC CTCCCCCCGG CAAATGTCCT 120 

CCCGCTGCCT TGCAGCCTGC ACTTTGCACA TGCTCACCCC CAGCACAGTC CCACTGGCCC 180 

30 

CTCAMCTCCC CTTCCCTGAG CTCCTTCCCA AGGACTCCTG GTCACTGCCT GCTGTGCAKT 240 

CAGAGGCCCA GGGTCCAGCA GCCCGGSGGG AACGGGTGCT GCCTSTTCCT CCAGTTAGCT 300 

35 CCAGYTCAGG TCTGAGACCC GTGYTGAGTA AAGGTCTGAG CAMCGACCGT GCCCTCTGCC 360 

CAGGGCTCGG TXXTTGAGCAG " CTGGTTTTCC TGCAGGAAX5G" TTGGAGCAAG CAAAGTCCTT" " " "420" " 

CTCTGCCCTC AGGGTCAGCT GCCCAGACTG GGGCGGATGC AGAGAGGCAG GTGGGCTGTG 480 

40 

GCTGGACTGG TCCGGAGCTG GCTTCCTTAC CAGAAAAGCC TCAGCCTTCC TCTGGAAGCA 540 

TCCCCCGTTC TGGGCAAGGG GGAAGGGCTC CTTTAAGGGG TGTGCTTTCC CAGTGGGGAG 600 

45 CAGTCTGGCC CTGCCCCCTA CTAAAGCCTC TGCTCTCAGC ACTTTCCCCC AAGTCCTTGT 660 

AACTTGCTTG AAGGTGGGTT CTGGCTGCCA GCCAGTCCCT GGACAAACTC TCCTGCCCCT 720 

TTTAAATTTC ACTCATTTTG TATAAACCCA GCAGGCTGGT GTTTACTTAG CCCTGTAGCT 780 

50 

TTTTTCATTT TTTCTTTCCG TCTTTCTTCT TGAGTTCACG GTTCAATATT GCCTCCTCGC 840 

CCTGGTGAGG GGAGGTGCTG CTTTTCTGCC CCACCTGCCG GCTGGTTCCA GCAGCGCTGG 900 

55 NGCCCAGCTG GGGGGCCGGG ATGGGGGCTT CTCTCTCTGG GAGGGGTGCA GGTGCCCTCC 960 

CCAGGCTGGG AGGGTTCCTT CCCTAGCTCC CCATCTGCCC CCGCTGGTGA GAGTTGGGCT 1020 

TCTTGGTCTT GGAACTCCCT GGCATTGGGA ACAGAGCATT TCCAGCATTT GTTGTTGTTG 1080 

60 
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GTGTTTGCTC 
ATACTGGCAC 
GATGGGGAGT 
GTGCATTCAG 
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TGCCACATCC 
ACCAAATTGA 
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GAAAGGAGTT 
TAGGAGATTT 
ACCGCACAAA 
GTCTTTAGGG 
TTTCTAACAG 
TTCTGTTGTA 
GGG 
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10 



15 



20 



25 



30 



35 



CTAACCCTTTA GAAAATGAAT GTTAGAAGGT GCCTGCCGAG GCGGGACAGA 1140 

GCGCTGGAGA AGGCTCTGCT CAGCCXTTGAG AGTCCCTTCC TGCCXXZACCG 1200 

TTTAAAAAGG AAGCTGACCG CACAGTGTCC AGACGAATTG GCCCCCAGAA 1260 

TCTGTCCTGC CCTTCTGTGT CTGCGTGACC TCACCCAGCX: TAGGAGGGAG 1320 

GGTAGATTTG CCTCTCATTC AAAGTTCTGG GGCTrTGGGY GGAAAACAGC 1380 

GCTGTTGGGG AGACTCCTCC AGACCAGGAA CCCCAGAAGG AGACAGAGCC 1440 

TCCCACGCCA GGCCXTGGGC CAGGGTGATT GGACTGAGAA TTTGGCCACA 1500 

TGCTGGCTGG AACCAGAGGC CAGAAAGCCT GGCCTTGTCC CCATGTGGGA 1560 

CAGCCCTCTT GTCCXTCTTGA GCTCAGTGAA TTCCCACCAG GTGCCCACAG 1620 

TCAAATTCTA TATATTGAGA GAGTTGGAGA GTATATCAGA GATATTTTTG 1680 

GGTCTATGCA ATGTCAGTTT GGAATCTTCT TGAAAGTTTA ATGTTTTTAT 1740 

AAAGAAAATA AAGGTCTACA ATATCTTTAG G' l *' in T r i'J''X'r TTTCCTGTTT 1800 

CTGACCACAT GGCATGTCTA TCAGGATGGA GGGTGTCCAT GTTCTCCTCT 1860 

AGGTGATAAG GAGATGGSCG RAGGGGTGTT TTTTTCTTTG ACTCCCCTCC 1920 

AATGTTGCCA CCACTGCTTG AGTGGGCTGT GTTTGTTCCT CTGTCCCAGC 1980 

GAAAATAACA TTGTTAGGGG AACTCAGGCT AGTGTCAGCG TCTTGGTTTG 2040 

2043 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 
TAAATVAGAAG CGGGAGAATC TGGGCGTCGC TCTAGAGATC 
.GTCCCAGTGT CGGAGAGACC TGGAGGCCGT GAACTCCAGA 
CCCAGAGGCC AGGAGGTCCC TGGAGAAGGA GAAAAACAGC 
CTACGAGAAG GAACTGAAGT TTCTTCGGCA AGAGAACCGG 
GGCCATCTTT ATCCTCCTGA CGCTCGTCTA TGCCTACTGG 
CCCCACAACC AGCACAGGCT TCCACTTGGC CCCTTGGTCA 
GCCTCAATAG GACCAAGGTG CTGGGGTGTT CCCCTCCCAA 



229: 

GATGGGCTAG AGGAGAAGCT 60 

CTCCACAGCC GGGAGCTGAG 120 

CTAATGAACA AAGCCTCCAA 180 

AAGAACATGC TGCTCTCTGT 240 

ACCATGTGAG CCTGGCACTT 300 

GGATCAAGCA GGCACTTCAA 360 

CCTAGTGTTC AAGCATGGCT 420 
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TCCTGGCGGC CCAGGCCTTG CCTCCCTGGC CTGCTGGGGG GTTCCGGGTC TCCAGAAGGA 480 
CATGGTGCTG GTCCCTCCCT TAGCCCAAGG GAGAGGCAWT AAAGACACAA AGCTGOAAAT 540 



(2) INFORMATION FOR SEQ ID NO: 230: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 hasG pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

AATTGTGAAA TATTAGAATA TTGTTACTAT TTGACCCAAC TCAAAATCTC CATGGGAAAA 

20 

TACCTGTCGA TACCCACAGT ATTGTTGAAA ATAATCAGAT GCAGTATCAC AGCTGTGTCA 
GACTCTAGTA CCAGTTGGGC AATCAAGGCA CAGCTAAAAA TTGAAAACAA AGATCTGGAC 
25 AACAAAACAG CCAAAGGTGG GGGTCAAGAA GCTCTGACGT GTACCTAGCT GTAGAATGCT 
ATGCACACGT GCCAGGTGTA GTGTGCATAT CCAGGAAAAA CTGCAGAGAG CCCCAGTCTT 
CAMCTCTGGT TGACCATGAG CTCTGTGTAA GCAGGAAGTG AAGGCTAAGG CAGATTTAAG 

30 

CTCTGAAAGC ATTCCACAAC ATACACACAA ATCGTGCAAA GCATTAAGGA AATCTTGTTA 
CTGCTAAGTG TTGCTGACCC AGGAACAA 

35 



(2) INFORMATION FOR SEQ ID NO: 231: 

40 (i) SEQUENCE CHAIVVCTERISTICS : 

(A) LENGTH: 407 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 



GTATGCTGCC CCAAACCAAT ATGTGTGGCT GCCTTTWACC TGACTTCTCC AACATGTAGC 60 

50 CCCAAGAGGA GGCCTCTAGA CTRAGGGAGG GGCTGGTGAC CCAGGTGTGG TGGGGCTGCA 120 

TGARACTACC AGAGAGACAG ACATTCTGGA ACTCACCCTG GGGGATCCAG TGGATCTGCC 180 

TATGGTCTGG TCCACCCCAG ACCTGTGAGA TGTTCCTCAT GAGGATGCAC TTGTGCTTCT 240 

55 

GCAAGTATTG CTGCAGCTTC ATAGTGACTC CCACCAGCAC CAGCAATACA GYTAGCTACC 300 

TGTGGCCTTG GATCTCAGCC AGCATGGCTG GGAGAGGGAG CARCTGGGCA TGTACCCTAA 360 

60 ATGCTGTTAC CAGGGAAGGA CTCCCAGAGT GAAGACAAGT AGGGACT 407 



60 
120 
180 
240 
300 
360 
420 
448 
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5 (2) INFORMATIC»J FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 
15 GTATTTGATT TCAGGCTGCT AAATGGGCTC ATTTAGCATT CATTCCTTGA TGTAGACATT 60 

AAAAAAAAAA CTGAATAGCA TTCTTTCCAG GOTAACTAAT AAAGCAGACA TGCTAAGCCT 120 

ATAAATACAT CAGCACTGCA GCACACGTTT AAGGTTGCCA CGGACAAGGA TCACACAATA 180 

20 

GAGAACACTG TAGTTCGGTC TGCTCACAAG ACCCAGAACA TTGATCAGTT TTTGTTGTTG 240 

GTTTATTATT TTTCTGTTAA AAAATTGTGA AAAGTTTGTT TTAGCTAGAT GATATTTTAA 300 

25 TAGCTGCGAG TGCTTTGGAA CTATAAAGAT GTCACTACTT AACACACATA CCTTATGTTT 360 

TGTTTTGTTT TGTTTTACAC TCAGTATAAA TCAGGAGAAG TTAGCCAACC ATCTAGCATT 420 

TAGAATCCTC TTTTTTATTG TCTTCTAAGG ATATGGATGT TCCCATAACA GCAACAAAAC 480 

30 

AGCAACAAAA ACATTTCATA AATATCACTT GATAGACTGT AAGCACCTGC TTAACTTTGT 540 

GTNCCAAATA TTTAGTGTGT ATATATATAT ATATATACAC ACACACACAC ATATATATTC 600 

35 AACAAATAAA GCAAAATATA ACATGCATTT CACATTTTGT CTTTCCCTGT TACGATTTTA 660 

. . . „ . AT AGCAGAA C TGTM^GACAA.^TTrAGG^^ 1.20 

AAAACAGATT AACAACAACA AAGAAACTGT CTATTTGAGT GAAGTCATGC TTTCTATTAT 780 

AATAACTTGG CTTOGGTTAT CCATCAAATG CACACTTATA CTGTTATCTG 830 



40 



45 

(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

55 

CCAGAAGAAA GACCAATCTA GAATATGGAA CTCTAATCAC TTCTAGTATT TCAACTTCCT 60 
AGCAGAAATG AACTTGGCCC TAGACCTAGG GGATAAGCAA TGTTCTTTAT GTAGCCAATG 120 
60 CTACGGAAAC AAAAGAGGTG AAAGAGACCC TTTTTTTATA CTTAATGTAC ATATATTGAC 180 
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TITTTGAGCA AGAATGCCAG AAATAGCCTT CATTTCTACC CTGCAAAATA ATCCAGATCT 240 



GCTTTCTAAA ATGRANTCAG TTTCTAAAGT GAAACATGCA ATATTTATGC TCTGACTGAC 300 

5 

TCCTGAATTG GARGAGGAAG RACTTCTGTT TACAGAAAAC YGTATTGTTA TATATGTCAG 360 



GCTGTGTATT GTGACTATCA GCATTCTGGT GCAAATGAAC TTTTCTCCAT CATCGACTGT 420 



10 GGAAAATTGA TACTTTTAAA GCATATTCTT CTATGAGCAC AGGTCCTCCT AGTGAAACTT 480 



AATTTGACAA AGGGTGTCAT ATGCTTTCCT AACCTGAWTT GTATTAACAT TCACAGAGCC 540 



TACATTTTCT CATTAGGGTT RTGATGCTCA GTATCTTTCC AAGTGCCAGG CAGRGCTTNC 600 

15 

CTTTTCTGAT CAAACATACC ATTTTTTGTA TTTCACAACT ATAGACAGTC ACTTCTGCAG 660 



TCCCAATTTA AAAATGCAGA ACTGCTTTAT CCAAGAATGC TGAAAAATAC TGTTCTATCC 720 



20 AGGTTTCCTA AACTATAAAA GCAGATTTTG CTTTTGTTTG TTAATCATAG GCATGGCCGA 780 



GCATTGTGGA TTAGCCTGAG GCTTAAAATC AGATGCATGT CTGGTAAGAT GACCACTGTC 840 



TCACTATCAA GAGCCTGCAG AGCCATTTTC CAGACCTCTG ATTGCCCAGA ACACATAGTC 900 

25 

CCCACGITTC TAATTTGGAG CAAATCTAAA AG 932 



30 

(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2786 bcise pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxable 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 

40 

TTAGCAGGGT GAGCTGTTAA AACAGCACAC ATCTCTCATC CCCTCTTCCT TTATTCCCCC 60 
CTGGGTTTCA GAAAGGAAGG ATATATGGGG ACCACCTCCC CCTTCTTTGA TCCCAGCATC 120 



45 TCAGTCCCCC TCCCAACCCT CCATATGGCT CTCAATGGTG CTCACTTGCT TGGAAGCAGG 180 



CTCCCAATAG GGAGGGGSCT GCCCTCTACA GTCTCTTTGA CTGTAAGACA GGGCTCTGTA 240 



TCAGTGAGAC GATGAGAAAA GTCCCAGGCT AATGGCAGAA ATTTGCACTT TGAACATGTG 300 

50 

TGTTTTTGTG TTGTGGAACC TGAGATTCCT TATTTATTAA CAGGAAGTCT GATTTTTTTT 360 



TTTTGGAGTC TTTGITGCTA TATTTTGTGG QGCTGGGAGA GAGAGATTAG ATTATTTTGA 420 
55 CATGGGATCC CTTCCATAAC AGGTACTTTG AAGGCAAGAC ATAGGGTTGA AGAAGCACAA 480 



CCAGCCTCTG AAATCATAGC TCTCCAGTGG CTTTTAAAGA AAGCTGGTCC TCAGCACTAA 540 



CAAAATCACT ACAATAGCCT AGTGCTTTTT TGGAAGCCTT TTTAGGGAAG AATGTTAGGT 600 

60 
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TCATGGTAAC TAGTATGCTC TTTGAGATTT TTACAGTGTT GAAACTTAAG AATTTTGAGA 660 

GGGTGAGGAG GGTTGTTCAG AATCTAAATT ACAGATAGAT GATTGTTTCT TGTGAATTTG 720 

5 TTTCTTTTCC TTTTTTTTTG TCCCTACCAT TTCCTTACAT TTCCCTTGGG GCCCATCTCT 780 

GGCTCCTTGC TOTITGTTTC TTGCTTTGCT TTATCAGTTC ATTCCAGCTC CCTGTTAGTG 840 

AAGGACACTG CTGTTAGTGA AGGAACAAAG TCTATGAGTC CTAAAATTTT AAGTCAAAGA 900 

10 

AAACTGCTCT GTTTCCCCTT TAGTAACACT TCTGAAGAGG AAAAACTTCA ATAGCCAAAG 960 

TTAATAATCC TATATAATAA TTGCTTTGGC TTTCACCTAA AATTCTGGGC ATCACAATTT 1020 

15 CXnTGGGATA GAGGTTGTGT TGGGGAATAG ATTGCTTATT GCTGTTCACT GGAGAGAAAA 1080 

GGTAGTCTTT TTGTACAAGG TCATACCGCC AGAAGCCCCA AATCCTATTT TGGCTCATCT 1140 

TCAGGTAAAG AGTAATTCCT ATCCTGTGTG CCTCAGAAGC TAGAATCGAA GGCTTACCCT 1200 

20 

ATTCATTGTT TATTGTCAGA AATGCATGAT GGCTCTTGGA AAGAATGACG TTTTGCTGGA 1260 

AAAAAAAAAA AGAACAGTTT GTGTTTCACA AACATGGCTT ATCAATTTTT TCAAAGAATT 1320 

25 CTTTTTTCCC AAAAAGAGGA GTAACAAAAT GTCATTTCTG AAAGAGGCTT ACTTTATACC 1380 

AACTAGTGTC AGCATTTGGG ATGCCAGGGA ACAGAGAGTG AGACACCTAC AATCACCAGT 1440 

CTCAAATGCG CTATTGTTTC TTTTCAGAGT GTTGCAGATT TGCCATTTCT CCATAATATG 1500 

30 

GGGATAGAAA ATGGAATAAA GATAGAAGGG ATGTAGAATA TGCTTTCCTG CCAACATGGT 1560 

TTGGAGTCGA CTTTGGTATA TTGACTAGAT ITXSAAAATAC AAGATTGATT AGATGAATCT 1620 

35 ACAAAAAAGT TGTCCTCCTC TCAGGTCCCT TTTACACTTT TTGACTAACT JVGCATCTATA 1680 

_ _ _ ^ TTCCA CACTT AGCrrrmG TCACACTTAT CCTTTGTCTC CXTTAAATTTC ATT TGCA GTG 1740 

GTTAGTCATC AGATATTTTA GCCACCTACA CAAAAGCA7A CTGCATTTTT AAAAATCTTT 1800 

40 

CTGAGATGGG AGAAAATGTA TTCTCCTTTC CTATACCGCT CTCCCAACAA AAAAACAACT 1860 

AGTTAGTTCT ACTAATTAGA AACTTGCTGT ACTTTTTCTT TTCTTTTAGG GGTCAAGGAC 1920 

45 CCTCTTTATA GCTACCATTT GCCTACAATA AATTATTGCA GCAGTTTGCA ATACTAAAAT 1980 

ATTTTTTATA GACTTTATAT TTTTCCTTTT GATAAAGGGA TGCTGCATAG TAGAGTTQGT 2040 

GTAATTAAAC TATCTCAGCC GTTTCCCTGC TTTCCCTTCT GCTCCATATG CCTCATTGTC 2100 

50 

CTTCCAGGGA GCTCTTTTAA TCTTAAAGTT CTACATTTCA TGCTCTTAGT CAAATTCTGT 2160 

TACCTTTTTA ATAACTCTTC CCACTGCATA TTTCCATCTT GAAITGGTGG TTCTAAATTC 2220 

55 TGAAACTGTA GTTGAGATAC AGCTATTTAA TATTTCTGGG AGATGTGCAT CCCTCTTCTT 2280 

KGTGGTTGCC CAAQGTTGTT TTGCGTAACT GAGACTCCTT GATATGCTTC AGAGAATTTA 2340 

GGCAAACACT GGCCATGGCC GTGGGAGTAC TGGGAGTAAA ATAAAAATAT CGAGGTATAG 2400 

60 
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10 



15 



ACATAGAGCA CTTGAACXTTC 
CCAATCTAAC TAAGATTATT 
TGTTTTTCCC ACTTTCCAAT 
AATATTTTAT TTCTTAAAAG 
CATGGGAGAG ACAGGTAGTT 
TAATAAAAGG GATACTTTGC 
TTAAAAATAA ATATGT 



462 

CTTTGTACCT GTTTGGGGAA AAAGTATAAT 
ATAGTCTGGT TGTTTGAAAT ACCATTTTTT 
GTACTCAAGA AAATTGAACA AATGTAATGG 
CCTTTTTTGC CTGTTGTAAT GTGCAGGACC 
ACCTGAATAT AGGTTGAAAA GGTTATGTAA 
TTTTCAAATC TTTGTTTTCT CTTATTCTAG 



2460 
2520 
2580 
2640 
2700 
2760 
2786 



20 



25 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 
GGGTGCAGGA ATTCGGCACG AGAGAATGTT TGATTTTCTT TCCTATTTTA 
CTCTTGTTGA TGTTGAAAAC TTACCTTAGT GAAGATGTGT TTCAACATGC 
TACCTGCATA ATCACAGCTA TGCATCTATT CAAAGTGATG ATCTGTGGGA 
GAGGTCACAA ACCAAACACT AGATGTAAAG AGAATGATGA AAACCTGGAC 
GGATTTCCTT TAGTGACTGT TCAAAAGAAA GGAAAGGAAC TTTTTATACA 



AGGATCTTCT 
TGTTGTCCTT 
TAGTTTTAAT 
CCTGCAGAAA 
ACAAGAGAGA 



40 



45 



50 



55 



60 



TTCTTTTTAA ATATGAAGCC TGAAATTCAG CCTTCAGATA CAAGGTACAT GCCCTCTTTC 
TTTTCATGCC ATCTCTTTTG CACTCTCAGG TGGAAATATT TTTAAGTGTT TTATAATCAT 
AAGTTCTTGT GAAACCTAAC AAGATTATCC CTTCCTAA 

(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 591 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 
AGGATGAAGA GGAAATTATC TCTTGGATTG CTCTCCAGGA AATCCTTCTC TATACTTTAA 
AAGCTCTTGT TCTTTTCTAG GARTCCAATG TGCTGATTGC TGCTAACAGT CAGGGTACAA 



60 
120' 
180 
240 
300 
360 
420 
458 
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TTAAGGTGCT AGAATTGGTA TGAAGGGTTA 
AAATACATCT GCAGCTGACA ATGAGAGARG 
5 CAAAGTCATC ATGGGTTTTG GATTTGTTTT 
TTTATGAGCC TTTGGGACAT TGGGAATACC 
ATGGACATTG CTGCTCTTGG TGGTGTTATC 

10 

TGAATAAAAA TAAATAACWA AACAATAAAA 
TTTTTGTCAA ATGCNGCAAG AGATAACTCT 
15 GTAACAGTGA ATAATTTGTA AAGTTCGTAT 



463 



ACTCAAGTCA AATTGTACTT GATCCTGCTG 180 

AAACAGAAAA TGTCATGTGA TGTCTCTCCC 240 

GAATATTTTT TCTTTTTTTC TTKTCCCTCC 300 

CAGCCAACTC TCCACCATCA ATGTAACTCC 360 

TAATTTTTGT GATAGGGAAA CAAATTCTTT 420 

GTTTATTGAG CCACAGTTGA GCTTGGAAAG 480 

TTTTANGAAG TAGCATATGT GAACTATAAT 540 

TTCCCAACCT CTTTGGGAAT T 591 



20 (2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1286 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 237: 

30 TCTTTTTAAG GTACAGCAGG GAAGAACTGG AAACTCAGAG AAAGAAACTG CCCTTCCATC 60 

TACAAAAGCT GAGTTTACTT CTCCTCCTTC TTTGTTCAAG ACTGGGCTTC CACCGAGCAG 120 

GAGATTACCT GGGGCAATTG ATGTTATCGG TCAGACTATA ACTATCAGCC GAGTAGAAGG 180 

35 

CAGGCGACGG GCAAATGAGA ACAGCAACAT ACAGGTCCTT TCTGAAAGAT CTGCTACTGA 240 



AGTAGACAAC AATTTTAGCA AACCACCTCC GTTTTTCCCT CCAGGAGCTC CTCCCACTCA 300 

40 CCTTCCACCT CCTCCATTTC TTCCACCTCC TCCGACTGTC AGCACTGCTC CACCTCTGAT 360 

TCCACCACCG GGTTrTCCTC CTCCACCAGG CGCTCCACCT CCATCTCTTA TACCAACAAT 420 

AGAAAGTGGA CATTCCTCTG GTTATGATAG TSGTTCTGCA CGTGCATTTC CATATGGCAA 480 

45 

TGCGATGAAG AACGATACAG ATACAGGGAA TATGCAGAAA GAGGTTATGA GCGTCACAGA 540 

GCAAGTCGAG AAAANGAAGA AOGACATAGA GAAAGACGAC ACAGGGAGAA AGAGGAAACC 600 

50 AGACATAAGT CTTCTCGAAG TAATAGTAGA CGTCGCCATG AAAGTGAAGA AGGAGATAGT 660 

CACAGGAGAC ACAAACACAA AAAATCTAAA AGAAGCAAAG AAGGAAAAGA AGCGGGCAGT 720 

GAGCCTGCCC CTGAACAGGA GAGCACCGAA GCTACACCTG CAGAATAGGC ATGGrTTTTGG 780 

55 

CCTTTTGTGT ATATTAGTAC CAGAAGTAGA TACTATAAAT CTTGTTATTT TTCTGGATAA 840 

TGTTTAAGAA ATTTACCTTA AATCTTGTTC TGTTTGTTAG TATGAAAAGT TAACTTTTTT 900 

60 TCCAAAATAA AAGAGTGAAT TTTTCATGTT AAGTTAAAAA TCTTTGTCTT GTACTATTTC 960 
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AAAAATAAAA AGACAGCAAT GACTTTATAT CCAAGAAAGG AATGTGAATG AGTCACTTAA 1020 

CAGGGAATCT AAAGAGCTGT GTTAGCTGTG TACATACACA GATTATCTGA GAAAAGGTCA 1080 

AGGGTTCCAC TTGGGCCACA GTTTrrTTGT TAATCAAACA CCACTCTCTT AAGRGGCTGC 1140 

ATCACAAARG GCAACCAARG GGCCCCTCTT ARGGCTTTGA GGATTAAAAC TAGTCTTTAT 1200 

10 CCATTACTGC TGTGGACACT CITGGCTTRG TATWTTTAGG GGGGNTCCTT ACCTTTTTTr 1260 

GGTTTTCCNC ACCTTTTTGG TTGGGC 1286 



15 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 238: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH; 734 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiable 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

ATGGCAGCGC AGAAGGACCA GCAGAAAGAT GCCGAGGCGG AAGGGCTGAG OGGCACGACC 60 

CTGCTGCCGA AGCTGATTCC CTCCGGTGCA GGCCGGGAGT GGCTGGAGCG GCGCCGCGCG 120 

30 

ACCATCCGGC CCTGGAGCAC CTTCGTGGAC CAGCAGCGCT TCTCACGGCC CCGCAACCTG 180 

GGAGAGCTGT GCCAGCGCCT CGTACGCAAC GTGGAGTACT ACCAGAGCAA CTATGTGTTC 240 

35 GTGTTCCTGG GCCTCATCCT GTACTGTGTG GTGACGTCCC CTATGTTGCT GGTGGCTCTG 300 

-GeTGTCTTTT-TCGGeGCCTG-TTAACATTCT-^rrATCTGCGC -^A ~ " — y60~ 

CCTCTTTGGC CGAAAGGTGA GCCCAGCGCA TCATATGCTC TGGCTGGAGG CATCTCCTTC 420 

40 

CCCTTCTTCT GGCTGGCTGG TGCGGGCTCG GCCGTCTTCT GGGTGCTGGG AGCCACCCTG 480 

GTGGTCATCG GCTCCCACGC TGCCTTCCAC CAGATTGAGG CTGTGGACGG QGAGGAGCTG 540 

45 CAGATGGAAC CCGTGTGAGG TGTCTTCTGG GACCTGCCGG CCTCCCGGGC CAGCTGCCCC 600 

ACCCCTGCCC ATGCCTGTCC TGCACGGTCT GCTGCTCGGG CCCACAGCGC CGTCCCATCA 660 

CAAGCCCGGG GAGGGATCCC GCCTTTGAAA ATAAAGCTGT TATGGGTGTC ATTCAAAAAA 720 

AAAAAAAAAA AAAA 734 



(2) INFORMATION FOR SEQ ID NO: 239: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 

5 

CGGGGTCTTC AGGGTACCGG GCTGGTTACA GCAGCTCTAC CCCTCACGAC GCARACATGG 60 

CAGCQCAGAA GGACCAGCAG AAAGATGCCG AGGCGGAAGG GCTGAGCGGC ACGACCCTQC 120 

10 TGCCGAAGCT GATTCCCTCC GGTGCAGGCC GGGAGTGGCT GGAGCGGCGC CGCGCGACCA 180 

TCCGGCCCrc GAGCACCTTC GTQGACCAGC AGCGCTTCTC ACGGCCCCGC AACCTGGGAG 240 

AGCTGTGCCA GCGCCTCGTA CGCAACGTGG AGTACTACCA GAGCAACTAT GTGITCGTGT 300 

15 

TCCTGGGCCT CATCCTGTAC TGTGTGGTGA CGTCCCCTAT GTTGCTGGTG GCTCTGGCTG 360 

TCTTTTTCGG CGCCTGTTAC ATTCTCTATC TGCGCACCTT GGAGTCCAAG CTTGTGCTCT 420 

20 TTGGCCGAGA GGTGAGCCCA GCGCATCAGT ATGCTCTGGC TGGAGGCATC TCCTTCCCCT 480 

TCTTCTGGCT GGCTGGTGCG GGCTCGGCCG TCTTCTGGGT GCTGGGAGCC ACCCTGGTGG 540 

TCATCGGCTC CCACGCTGCC TTCCACCAGA TTGAGGCTGT GGACGGGGAG GAGCTGCAGA 600 

25 

TGGAACCCGT GTGAGGTGTC TTCTGGGACC TGCCGGCCTC CCGGGCCAGC TGCCCCACCC 660 

CTGCCCATGC CTGTCCTGCA CGGCTCTGCT GCTCGGGCCC ACAGCGCCGT CCCATCACAA '720 

30 GCCCGGGGAG GGATCCCGCC TTTGAAAATA AAGCTGTTAT GGGTGTCATT CAGGAAAAAA 780 

AAAAAAAA/U^ AAAAAAAAAA AAAAAAAAA 809 



35 



(2L Il!!ElQRMATXQN_JE=^5R_SEQ_JCDJb^ 



(i) SEQXJENCE CHARACTERISTICS: 
40 (A) LENGTH: 2201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

TCGACCCACG CGTCCGGCAA CATGGCGGCT GCCGTGGTGC AGCGCCCGGG CTGAGCGACA 60 

GCAAGTGCAG CGGGCTCCTA CCCCGGGTGA GGGGTGGCCT CCGCGTGGGA TCGTGCCCTC 120 

50 

TTCAGCCCGC TCCTGTCCCC GACATCACGT GTATTCCGCA CGTCCCCTCC GCGCTGTGTG 180 

TCTACTGAGA CGGGGAGGCG TGACAGQGCC CGGGTCCCTT CTCAGTGGTG CTCTGTGCTT 240 

55 CAGGGCAAGC TCCCCGTCTC CGGGCGCACT TCCCTCGCCT GTGTTCGGTC CATCCTCCTT 300 

TCTCCAGCCT CCTCCCCTCG CAGGCGGATG AMCCGGACGA CGGGCCAGTG CCTGGCACCC 360 

CGGGGTTGCC ARGGTCCAMG GGGAACCCGA AGTCCGAGGA GCCCGARGTC CCGAACCAGG 420 

60 



SDOCID: <WO_9839448A2_lA> 



wo 98/39448 



PCT/US98/04493 



466 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ARGGGCTGCA GCGCATCAMC GGCCTGTCTC CCGGCCGTTC GGCTCTCATA GTGGCGGTGC 480 

TGTGCTACAT CAATCTCCTG AACTACATGG ACCGCTTCAC CGTGGCTGGC GTCCTTCCCX5 540 

ACATCGAGCA GTTCTTCAAC ATCGGGGACA GTAGCTCTGG GCTCATCCAG ACCGTGTTCA 600 

TCTCCAGTTA CATGGTGTTG GCACXrTGTGT TTGGCTACCT GGGTGACAGG TACAATCGGA 660 

AGTATCTCAT GTGCGGGGGC AOTGCCTTCT GGTCCCTGGT GACACTGGGG TCATCCTrCA 720 

TCCCCGGAGA GCATTTCTGG CTGCTCCTCC TGACCCGGGG CCTGGTGGGG GTCGGGGAGG 780 

CCAGTTATTC CACCATCGCG CCCACTCTCA TTGCCGACCT CTTTGTGGCC GACCAGCGGA 840 

CCGGATGCTC AGCATCTTCT ACTTTGCCAT TCCGGTGGGC AGTGGTCTGG GCTACATTGC 900 

AGGCTCCAAA GTGAAGGATA TGGCTGGAGA CTGGCACTGG GCTCTGAGGG TOACACCGGG 960 

TCTAGGAGTG GTGGCCGTTC TGCTGCTGTT CCTGGTAGTG CGGGAGCCGC CAAGGGGAGC 1020 

CGTGGAGCGC CACTCAGATT TGCCACCCCT GAACCCCACC TCGTGGTGGG CAGATCTGAG 1080 

GGCTCTGGCA AGAAATCCTA GTTTCGTCCT GTCTTCCCTG GGCITCACTG CTGTGGCCTT 1140 

TGTCACGGGC TCCCTGGCTC TGTGGGCTCC GGCATTCCTG CTGCX5TTCCC GCGTGGTCCT 1200 

TGGGGAGACC CCACCCTGCC TTCCCGGAGA CTCCTGCTCT TCCTCTGACA GTCTCATCTT 1260 

TGGACTCATC ACCTGCCTGA CCGGAGTCCT GGGTGTGGGC CTGGGTGTGG AGATCAGCCG 1320 

CXX3GCTCCGC CACTCCAACC CCCGGGCTGA TCCCCTGGTC TGTGCCACTG GCCTCCTGGG 1380 

CTCTGCACCC TTCCTCTTCC TGTCCCTTGC CTGCGCCCGT GGTAGCATCG TGGCCACTTA 1440 

TATTTTMATC TTCATTGGAG AGACCCTCCT GTCCATGAAC TGGGCCATCG TGGCCGACAT 1500 

TCTGeTGTAe GTGGTGATce CTAeecGACG -CTceAccxsee -GAeGcerree- AGATeGTGer a560 - 

GTCCCACXTTG CTGGGTGATG CTGGGAGCCC CTACCTCATT GGCCTGATCT CTGACCGCCT 1620 

GCGCCGGAAC TGGCCCCCCT CXrrTCTTGTC CGAGTTCCGG GCTCTGCAGT TCTCGCTCAT 1680 

GCTCTGCGCG TTTGTTGGGG CACTGGGCX5G CXSCACTTTCC TGGGCACCGC CATCTTCATT 1740 

GAGGCCGACC GCCGGCX3GGC ACAGCTGCAC GTGCAGGGCC TGCTGCACGA AGCAGGGTCC 1800 

ACAGACGACX: GGATTGTGGT GCCXTCAGCGG GGCCGCTCCA CCCGCGTGCC CGTGGCCAGT 1860 

GTGCTCATCT GAGARGCTGC CGCTCACCTA CCTGCACATC TGCCACAGCT GGCCCTGGGC 1920 

CCACCCCACG AAGGGCCTGG GCCTAACCXTC TTGGCCTGGC CCAGCTTCCA GAGGGACCCT 1980 

GGGCXX3TGTG CCAGCTCCCA GACACTACWT GGGTAGCTCA GGGGAGGAGG TGGGGGTCCA 2040 

GGAGGGGGAT CCCTCTCCAC AGGGGCAGCC CCAAGGGCTC GGTGCTATTT GTAACGGAAT 2100 

AAAATTTGTA GCCAGACCCC AGGTGCCTGC TCTCGTCTTT CTCTGGGTGG CCTCTGATCT 2160 

TGCACCCCGT CTTCACCCCA GGGCTCCTGA AGACTGTGGG T 2201 
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(2) INFORMATION FOR SEQ ID NO: 241: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1661 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J : SEQ ID NO: 241: 

GTCCTTCCCG ACATCGAGCA GTTCTTCAAC ATCGGGGACA GTAGCTCTGG GCTCATCCAG 60 

15 

ACCGTGTTCA TCTCCAGTTA CATGGTGTTG GCACCTC5TGT TTGGCTACCT GGGTGACAGG 120 

TACAATCGGA AGTATCTCAT GTGCGGGGGC ATTGCCTTCT GGTCCCTGGT GACACTGGQG 180 

20 TCATSCTTCA TCCCCGGAGA GCATTTCTGG CTGCTCCTCC TGACCCGGGG CCTGGTGGGG 240 

GTCGGGGAGG CCAGTTATTC CACCATCGCG CCCACTCTCA TTGCCGACCT CTTTGTGGCC 300 

GACCAGCGGA SCGGATGCTC AGCATCTTCT ACTTTGCCAT TCCGGTGGGC-^^GTGGTCTGG 360 

25 

GCTACATTGC AGGCTCCAAA GTGAAGGATA TGGCTGGAGA CTGGCACTGG GCTCTGAGGG 420 

TGACACCGGG TCTAGGAGTG GTGGCCGTTC TGCTGCTGTT CCTGGTAGTG CGGGAGCCGC 480 

30 CAAGGGGAGC CGTGGAGCGC CACTCAGATT TGCCACCCCT GAACCCCACC TCGTGGTGGG 540 

CAGATYTGAG GGCTCTQGCA AGAAATCCTA GTTTCGTCCT GTCTTCCCTG GGCTTCACTG 600 

CTGTGGCCTT TGTCACGGGC TCCCTGGCTC TGTGGGCTCC GGCATTCCTG CTGCGTTCCC 660 

35 

GCGTGGTCCT TGGGGAGACC CCACCCTGCC TTCCCGGAGA CTCCTGCTCT TCCTCTGACA 720 



GTCTCATCTT TGGACTCATC ACCTGCCTGA CCGGAGTCCT GGGTGTGGGC CTGGGTGTGG 780 

40 AGATCAGCCG CCGGYTCCGC CACTCCAACC CCCGGGCTGA TCCCCTGGTC TGTGCCACTG 840 

GCCTCCTGGG CTCTGCACCC TTCCTCTTCC TGTCCCTTGC CTGCGCCCGT QGTAGCATCG 900 

TQGCCACTTA TATTTTCATC TTCATTGGAG AGACCCTCCT GTCCATGAAC TGGGCCATCG 960 

45 

TGGCCGACAT TCTGCTGTAC GTQGTGATCC CTACCCGACG CTCCACCGCC GAGGCCTTCC 1020 

AGATCGTGCT GTCCCACCTG CTGGGTGATG CTGGGAGCCC CTACCTCATT GGCCTGATCT 1080 

50 CTGACCGCCT GCGCCGGAAC TGGCCCCCCT CCTTCTTGTC CGAGTTCCGG GCTCTGCAGT 1140 

TCTCGCTCAT GCTCTGCGCG TTTCrrTGGGG CACTGGGCGG CGCACTTTCC TGGGCACCGN 1200 

CATCTTCATT GAGQCCGACC GCCGGCGGGC ACAGCTGCAC GTGCAGGGCC TGCTGCACGA 1260 

55 

AGCAGGGTCC ACAGACGACC GGATTGTGGT GCCCCAGCGG GGCCGCTCCA CCCGCGTGCC 1320 

CGTGGCCAGT GTGCTCATCT GAGAGGCTGC CGCTCACCTA CCTGCACATC TGCCACAGCT 1380 

60 KGCCCTGGGC CCACCCCACG AAGGGCCTGG GCCTAACCCC TTGGCCTGGC CCAGCTTCCA 1440 
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GAGGGACCCT GGGCCX3TGTG CCAGCTCCCA GACACTACMT GGGTAGCTCA GGGGAGGAGG 1500 

TGGGGGTCCA GGAGGGGGAT CCCTCTCCAC AGGGGNCACC CCAAGGGCTC GGTGCTATTT 1560 

5 

GTAACGGAAT AAAATTTCTA GCCAGACXTCC AGGTGCCTGC TCTCGTCTTT CTCTGGGTGG 1620 

CCTCTGATCT TQCACCCCX3T CTTCACCCCA GGGCTCCTGA A 1661 

10 



(2) INFORMATION FOR SEQ ID NO: 242: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS; double 
(D) TOPOLOGY: linear 

20 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 
NGACAGAAAA GCAGAAGATG AGACTCTGTT CATTCACTTT TCCTAGGCCC ATCCTGTGGT 60 
25 CATCTTTCCC CCTCCCATCA TACCTCCTCC TTCCTGGAGC CTCTGCCGGC TTGGCTGTAA 120 
TGGTGGCACT TACCTGGATA TTTCAGTGGG AGGATGAAAG GCGAGACTCA CCCTACGCGG 180 
TGGGACAGAT GGGGAGAGGA AAAAGGCAGA GATNGCCAGG AGAGGGGTGC AGGACAAACC 240 

30 

AGAGAGGTTG GGTCAGGGGA AAAGTGTNGG GAGAAAGTGG GGTGCAGGCC CTGCAGGCCG 300 
GTTTAGCCAG CAGCTGCGGC CTCCCCGGGC CCTTGGCATC CAACTTCGCA GACAGGGTAC 360 
35 CAGCCTCCTG GTGTGTATCA TAGGATTTGT TCACATAGTG TTATGCATGA TCTTCGTAAG 420 
- " GWA;5CSffiGC"CGTO CACCATGAC^ " "480 

ATATATATGT ATGTAAATTA TAGCACTGAG GGCCCTGCTG CCCTGCTGGA CCAAGCAAAA 540 

40 

CTAAGCCTTT TGGTTTGGGT ATTATGTTTC GTTTTGTTAT TTGTTTGTTT TTGTGGCTTG 600 
TCTTATGTCG TGATAGCACA AGTGCCAGTC GGATTGCTCT GTATTACAGA ATAGTGTTTT 660 
45 TAATTCATCA ATGTTCTAGT TAATGTCTAC CTCAGCACCT CCTCTTAGCC TAATTTTAGG 720 
AGGTTGCCCA ATTTTGTTTC TTCAATITTA CTGGTTACTT TTTTGTACAA ATCAATCTCT 780 
TTCTCTCTTT CTCTCCTCCC CACCTCTCAC CCTTGCCCTC TCCATCTCCC TCTCCCGCCC 840 

50 

TCCCCTCCTC CCTCTGGCTC CCCGTCTCAT TTCTGTCCAC TCCATTCTCT CTCCCTCTCT 900 
CCTGCCTCCT GCTGCCCCCT CCCCAGCCCA CTTSCCCGAG TTGTGCTTGC CGCTCCTTAT 960 
55 CTGTTCTAGT TCCGAAGCAG TTTCACTCGA AGTTGTGCAG TCCTGGTTGC AGCTTTCCGC 1020 
ATCTGCCTTC GTTTCGTGTA GATTGACGCG TTTCTTTGTA ATTTCAGTGT TTCTGACAAG 1080 
ATTTAAAAAA AAAAAAAGGA AAAAAAAAAA AAAAAAAAAC TCGAGGGGGG GCCCGGTACC 1140 

60 
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(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 
AACCCACGGC TGCTGCGGCA GGGCGTGGAG GGCAGAGGGC CGCGGAGGCG CAGTTGCAAA 
CATGGCTCAG AGCAGAGACG GCGGAAACCC GTTCGCCGAG CCCAGCGAGC TTGACAACCC 
CTTTCAGCCA CCACCAGCCT ATGAGCCTCC AGCCCCTGCC CCATTGCCTC CACCCTCAGC 
TCCCTCCTTG CAGCCCTCGA GAAAGCTCAG CCCCACAGAA CCTAAGAACT ATGGCTCATA 
CAGCACTCAG GCCTCAGCTG CAGCAGCCAC AGCTGAGCTG CTGAAGAAAC AGGAGGAGCT 
CAACCGGAAG GCAGAGGAGT TGGACCGAAG GAGNCGAGAG CTGCAGCATG CTGCCCTGGG 
RGGCACAGCT ACTCGACAGA ACAATTGGCC CCCTCTACCT TCTTTTTGTC CAGTTCAGCC 
CTGCTTTTTC CAGGACATCT CCATGGAGAT CCCCCAAGAA TTTCAGAAGA CTGTATCCAC 
CATGTACTAC CTCTGGATGT GCAGCACGST GGCTCTTCTC CTGAACTTCC TCGCCTGCCT 
GGCCAGCTTC TGTGTGGAAA CCAACAATGG CGCAGGCTTT GGGCTTTCTA TCCTCTGGGT 
CCTCCTTTTC ACTCCCTGCT CCTTTGTCTG CTGGTACCGC CCCATGTATA AGGCTTTCCG 



GAGTGACAGT TCATTCAATT TCTTCGTTTT CTTCTTCATT TTCTTCGTCC AGGATGTGCT 
CTTTGTCCTC CAGGCCATTG GTATCCCAGG TTGGGGATTC AGTGGCTGGA TCTCTGCTCT 
GGTGGTGCCG AAGGCAACAC AGCAGTATCC GTGCTCATGC TGCTQGTCGC CCTGCTCTTC 
ACTGGCATTG CTGTGCTAGG AATTGTCATG CTGAAACGGA TCCACTCCTT ATACCGCCGC 
ACAGGTGCCA GCTTTCAGAA GGCCCAGCAA GAATTTGCTG CTGGTGTCTT CTCCAACCCT 
GCGGTGCGAA CCGCARCTTG CCAATGCAGC CGCTGGGGCT GCTGAAAATG CCTTCCGGGC 
CCCGTGACCC CTGACTGGGA TGCCCTGGCC CTGCTACTTG AGGGAGCTGA CTTAGCTCCC 
GTCCCTAAGG TCTCTGGGAC TTGGAGAGAC ATCACTAACT GATGGCTCCT CCGTAGTGCT 
CCCAATCCTA TGGCCATGAC TGCTGAACCT GACAGGCGTG TGGGGAGTTC ACTGTGACCT 
AGTCCCCCCA TCAGGCCACA CTGCTGCCAC CTCTCACACG CCCCAACCCA GCTTCCCTCT 
GCTGTGCCAC GGCTGTTGCT TCGGTTATTT AAATAAAAAG AAAGTGGAAC TGGAAAAAAA 
AAAAAAAAAA AAAAAAAAAG GOOGOTCCNC 



60 
120 
180 
240 

ioo 

360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1350 



SDOCID: <WO 9839448A2_IA> 



wo 98/39448 



470 



PCT/US98/04493 



D (2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1529 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

15 TCCCAGAGGC CGGGGGGTTC CAGCTCTGCC TGTAGCAGAG CCCTGAGGAG GAGGAGGAAG 60 

AGGATGTGCT GAAATACGTC CGGGAGATCT TTTTCAGCTA GGGCATAAAC TGTGCACTGA 120 

ACTGTCTGCC GAGAGCAGCT GGAGGACAGC TGAGCTTCCA CTGGTGCTGC TGGGCCGMCC 180 

20 

GCCTGTGGGA ATGGGGCTCT CTGTGCTCCT ACCTTTGTGC CTTCTTGGGC CTGGCAGATT 240 

CACCTCAGGC CAGAAGCCCC TGGACACTCC GGGCCTTGGG GTGCCGTTCT GAGTGTGCGG 300 

25 AAGGCAGGAC TCAAAATGAG ATCCCATTTG ACTCCCTCTG TATGTACTGT GCCCTCTCCT 360 

GGCTCTTGAG GCTCTGGAGT CCCAATTGTC TGTGTTAGTC AGTGACCAGG TTCCAGGGAA 420 

AATRATGTCA TGTGGTGGTC CAACTTACTG GAACCAAAGA GACAGTT^CrT TGCAAAGAAA 480 

30 

AGGATCACTG CCAGGTGCAC TGGAATTGCT ACAGTTTAGT CCGCATGATC TCTCCTGAAG 540 

GAGGAAGCCT GTTTCAAAAA TAGTTTCCAT CATGAGTCTA TGAATGAGCT CCCACCTCTC 600 

35 CAGCCAGCCT AGAAAGCAAA CGAGCTGCCC ACAGTTCTCT GCCCTGTCTG GGAGGTTGAG 660 

tXXTACAGTGT -ATAGAXrrGGT^AAGCC75GRCKnGGCCTCCTC 7Z0" 

TCACCTGTAC CTTGGTCCCC GGGCAGCTAG CTATAAAGCA AGAGGGACAG GAGCCCAGAA 780 

40 

GAGACACTGA GGACAAGAGA TCACACCAGA GTACATGTCT CTGCCTCTGT TTTCAGTGTG 840 

GCTTTGGACA GGAATATATG AATAAATCAC TGCCATACAG GTTTTCCAAT ACACAAGTGC 900 

45 TAGAAAATAC ACACAATTCC CCAATGCGTA AGTTGTGCTA ATGTCTTTCC AAGTTCTGGG 960 

TTGGGAAGTG GAGGGTGGCA GCGTTTGTTT GTGCGCAACC GTCCAGTCCT GTTCACAGCG 1020 

AGGATTTGGA GTCCTCCAGG GTCTCATCAT GGGAGTGATT TGTCAGCGGA CGCCTCTGCC 1080 

50 

CTGTCTGGCT TCAGGTCCAG GGAAGCTTTG AAGCAGTCAA GCCTTGTCTT TGTACCCCAT 1140 

GTGTCCTGTC TTTGTTGAGT CACTCAGAGA TCACTCCTGG ACCTCTGGGG TTGGAGTTCC 1200 

55 AGTGATGGCT TATGGCGGCC CACTCACTAT GGTGGGCTGA GTGGAAGCTC CTTAACCATG 1260 

TCCCCAGAGA CACTGAGGTG CTCGCTCTTT TAATGTCCTC GTTTGTTGCC GTAAGTTCTT 1320 

TGCTAGGTTT CATTTTGGCA TTTGGCAAAT CAGCCTGGAA GTCTGGCCCC ATGACAGCAA 1380 

60 
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TCACTCCCTC CCCACCCTCC TGAAGCTAGA GGAAGATTTG CTCAGATCCA TTAATTAAAG 1440 

CAGGAATTGG TGTGACAATG AGCTGCATGG TTTAGGGAGT CTTTQGGAGC CTTGGAAGTC 1500 

5 CTGAAGGACA AACAATCTTG TACTAAGAA 1529 



10 (2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1537 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: double 

(D) TOPOIiOOT: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

20 GTGCGAGGTC CCCGCCAGCC CCCAGCGGCC TTCCCGGCCC GGGGCGCTCC CAGAGCAAAC 60 

GAGGCCCCTG AGAGCTCCAC CTAGTTCACA GGATAAAATC CCACAGCAGA ACTCGGAGTC 120 

AGCAATQGCT AAGCCCCAGG TGGTTGTAGC TCCTGTATTA ATGTCTAAGC TGTCTGTGAA 180 

25 

TGCCCCTGAA TTTTACCCTT CAGGTTATTC TTCCAGTTAC ACAGAATCCT ATGAGGATGG 240 

TTGTGAGGAT TATCCTACTC TATCAGAATA TGTTCAGGAT TTTTTGAATC ATCTTACAGA 300 

30 GCAGCCTGGC AGTTTTGAAA CTGAAAOTGA ACAGTTTGCA GAGACCCTGA ATGGTTGTGT 360 

TACAACAGAT GATGCTTTGC AAGAACTTGT GGAACTCATC TATCAACAGG CCACATCTAT 420 

CCCAAATTTC TCTTATATGG GAGCTCGCCT GTGTAATTAC CTGTCCCATC ATCTGACAAT 480 

35 

TAGCCCACAG AGTGGCAACT TCCGCCAATT GCTACTTCAA AGATGTCGGA CTGAATATGA 540 



AGTTAAAGAT CAAGCTGCAA AAGGGGATGA AGTTACTCGA AAACGATTTC ATGCATTTGT 600 

40 ACTCTTTCTG GGAGAACTTT ATCTTAACCT GGAGATCAAG GGAACAAATG GACAGGTTAC 660 

AAGAGCAGAT ATTCTTCAGG TTGGTCTTCG AGAATTGCTG AATGCCCTGT TTTCTAATCC 720 

TATGGATGAC AATTTAATTT GTGCAGTAAA ATTGTTAAAG TTGACAGGAT CAGTTTTGGA 780 

45 

AGATGCTTGG AAGGAAAAAG GAAAGATGGA TATGGAAGAA ATTATTCAGA GAATTGAAAA 840 

CGTTGTCCTA GATGCAAACT GCAGTAGAGA TGTAAAACAG ATGCTCTTGA AGCTTGTAGA 900 

50 ACTCCGGTCA AGTAACTGGG GCAGAGTCCA TGCAACTTCA ACATATAGAG AAGCAACACC 960 

AGAAAATGAT CCTAACTACT TTATGAATGA ACCAACATTT TATACATCTG ATGGTGTTCC 1020 

TTTCACTGCA GCTGATCCAG ATTACCAAGA GAAATACCAA GAATTACTTG AAAGAGAGGA 1080 

55 

CTTTTTTCCA GATTATGAAG AAAATGGAAC AGATTTATCC GGGGCTGGTG ATCCATACTT 1140 

GGATGATATT GATGATGAGA TGGACCCAGA GATAGAAGAA GCTTATGAAA AGTTTTGTTT 1200 

60 GGAATCAGAG CGTAAGCGAA AACAGTAAAG TTAAATTTCA GCATATCAGT TTTATAAAGC 1260 
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AGTTTAGGTA TGGTGATTTA GCAGAACACA AGAGAGCAAG AAAATGTGTC ACATCTATAC 1320 

CAAATTRAGG ATGTTGAGTT ATGTTACTAA TGTATGCAAC TTTAATTTTG TTTAACACTA 1380 

TCTGCCAAAA TAAACTTTAT TCCCTATAAC TTAAAATGTG TATATATATA TAATAGTTTA 1440 

TTATGTACAG TTAATTCTAC TGTTTTGGCT GCAATAAAAT CX3ATTTTGAA ATAAAWRAAA 1500 

AAAAAAAAAA AAGGGNGGCC GCTCTAGAGG ANCCAAG 1537 



15 



20 



25 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOtiOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 246: 
TGCAGGATTT GGCCAGGACC CSCCGCGGTG GCGGTTGCTA TCGCTTCGCA 
AGGCAGCCAG CTGAGAAGAG TTGAGGGAAA GTGCTGCTGC TGGGTCTGCA 
ATAACGTGCA GCCGAAAATA AAACATCGCC CCTTCTGCTT CAGTGTGAAA 
AGATGCTGCG GCTGGATATT ATCAACTCAC TGGTAACAAC AGTATTCATG 
CTGTGTTGGC ACTGATACCA GAAACCACAA CATTGACAGT TGGTGGAGGG 
TTGTGACAGC AGTATGCTGT CTTGCCGACG GGGCCCTTAT TTACCGGAAG 



40 



ATCCCAGCGG TCCTTACCAG AAAAAGCCTG TGCATGAAAA AAAAGAAGTT 
ATATTACTTT TTAGTTTGAT ACTAAGTATT AAACATATTT CTGKATTATT 
AAAAAAAAAA AAAAAAAATT TGGTGG 



GAACCTACTC 
GACGCGATGG 
GGCCACGTGA 
CTCATCGTAT 
GTGTTTGCAC 
CTTCTGTTCA 
TTGTAATTTT 
CCAAAAAAAA 
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(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1348 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
GTCTTTCTTT TNCTGTTTTG AGTTGGTGAG TGAGTGAATA GGGTAACATG GGCCTTCAGG 
ATGACCCCTT GGAACTGTGC CGAGTTCCTT AAATCTCAGC TGGGATCCTG GACCTGGGAG 
GGGCCAGCTC TGGAAAAACC TGGGAGTTGA TGCCGGAGGY TGGGAAGAAC 



60 
120 
180 
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TCTGCTCGAG GGCAGGGTGC CCTGGAACAC TGGTAGTTCT GGGGCTGGGA GGGAGAGGGG 240 

CTCCGGCTTT CTCTGAAATG AACACTGCTC TTCAGCAGTT CAAGTACTTG TTCTCAAAAC 300 

5 

ATTTTCTAAT TGATTGCTAG GTTTTCATAA GCATrGTTTC TTTAAGGCAT GGAAAGQGAA 360 

GAATGCTCAA GCAAGTCATG TTTGTTTTCA GTGGC3ATGGG CCCGCGTOCT CACTGCTGGG 420 

10 GGCTTCCCCT TGCATGTGGC ACCTTTGTCC AGGGCXACCA GGCAGACTCT TCCCACCTTC 480 

TCCCACTGAA GCACCAAGGG GCTTGAACCG TAATTTGGCT AATCAGAGGC ATTTTTTTTG 540 

TCCTAGTATC TTTCACACTT GTCCAACCGT CTTATTTTTT TAAAAGTTCT GTTGCTTGTA 600 

15 

TTAACACGAA ACTAGAGAGA AATAGTTTCT GAAGCCAGTT TATTGTGAAG ATCCCCAAGG 660 

GGAGGTTCGG TAGAGAAAAA TAGTAAGCTG GTTTAGAAAC TGACGAGGGC AAACAGCX^^; 720 

20 GACX^CATTGG AGAGGAATTT GCCAAAGATC TACCCTGAGA TAACGCCTGT CCAGTGTCrr 780 

CACCACGTGA ATAACCAGCG CTCCAAAGTG TTTTTCTGCT TTGAAAAAAA AAATTCCACA 840 

AGCTTTTAAA GGTGCATTTA AGAATCCATG TGACTTTAGA ATGGAACTGC CGGCCCTGGC 900 

25 

AACTGTCACG TGTGCTAGAA GGTTCGATGC CTCTGGAATG CATGTGATAC TCATCTCCAT 960 

TTTGTTTCCT TGATTGCATT TTrGTTCTTT TAGCAGATCT GTCXXTGTGG GTGGTGTCTA 1020 

30 AGAAGTCGGA CACCTTGGTT TTTGTGTTAG ATTGAGCTGG GCAGCTGCAA TCAGCTTCTT 1080 

TATATGCAAA TTAGGCACGA CCTATCTGTG GTTCCCTGGT TGGTGGCT7A TGAAGTGAGG 1140 

GGAGGGAGGG ATGTCACCCC AAAAGTAGGC CCTCCCATTG GCTTTGGCCA GGCCftGACAC 1200 

35 

TTCACATCGT TTACATGGTT CTGTGTAATT OTAAAGTTTA TGTGTATAAA GCGAAGCTGT 1260 

TTCTGTGAAA CTGTATATTT TGTAAATAAA TATATTGCTA CTTTGAGAWR AAAAAAAAAA 1320 

40 AAAAACTCGA GGGGGGCCCX3 GTACCCAA 1348 



45 (2) INFORMATION FOR SEQ ID NO: 248: 

(x) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1766 base pairs 

(B) TYPE: nucleic acid 
50 . (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

55 GTGCCGAATC GGCAGAGCGG CACGAGCGGC CACGAGAGCA GGCGGAGTAA AGGGACTTGA 60 

GCGAGCCAGT TGCCGGATTA TTCTATTTCC CCTCCCTCTC TCCCGCCCCG TATCTCTTTT 120 

CACCCTTCTC CCACCCTCGC TCGCGTASCA TGGCGGAGCG TCGGCGGCCA CTCAGTCCCA 180 

60 
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TTCCATCTCC TCGTCGTCCT TCGGAGCCGA GCCGTCCGCG CCCGGCGGCX; GCGGGAGCCC 240 

AGGAGCCTGC CCCGCCCTGG GGACGAAGAG CTGCAGCTCC TCCTGTGCGG TGCACGATCT 300 

GATTTTCTGG AGAGATGTGA AGAAGACTGG GTTTGTCTTT GGCACCACGC TC3ATCATGCT 360 

GCTTTCCXrTG GCAGCTTTCA GTGTCATCAG TGTGGTTTCT TACCTCATCC TOGOTCTTCT 420 

CTCTGTCACC ATCAGCTTCA GGATCTACAA GTCCGTCATC CAAGCTGTAC AGAAGTCAGA 480 

AGAAGGCCAT CCATTCAAAG CCTACCTGGA CGTAGACATT ACTCTGTCCT CAGAAGCTTT 540 

CCATAATTAC ATGAATGCTG CCATGGTGGA CATCAACAGG GCCCTGAAAC TCATTATTCG 600 

TCTCTTTCTG GTAGAAGATC TGGTTGACTC CTTGAAGCTG GCTGTCTTCA TCTGGCTGAT 660 

GACCTATGTT GGTGCTGTTT TTAACGGAAT CACCCTTCTA ATTCTTGCTG AACTGCTCAT 720 

TTTCAGTGTC CCGATTGTCT ATGAGAAGTA CAAGACCCAG ATTGATCACT ATGTTGGCAT 780 

CGCCCGAGAT CAGACCAAGT CAATTGTTGA AAAGATCCAA GCAAAACTCC CTGGAATCGC 840 

CAAAAAAAAG GCAGAATAAG TACATGGAAA CCAGAAATGC AACAGTTACT AAAACACX^VT 900 

TTAATAGTTA TAACGTCGTT ACTTGTACTA TGAAGGAAAA TACTCAGTGT CAGCTTGAGC 960 

CTGCATTCCA AGCTTTTTTT TTAATTTGGT GTTTTCTCCC ATCCTTTCCC TTTAACXXTTC 1020 

AGTATCAAGC ACAAAAATTG ATGGACTGAT AAAAGAACTA TCTTAGAACT CAGAAGAAGA 1080 

AAGAATCAAA TTCATAGGAT AAGTCAATAC CTTAATGGTG GTAGAGCCTT TACCTGTAGC 1140 

TTGAAAGGGG AAAGATTGGA GGTAAGAGAG AAAATGAAAG AACACCTCTG GGTCCTTCTG 1200 

TCCAGTTTTC AGCACTAGTC TTACTCAGCT ATCCATTATA GTTTTGCCCT TAAGAAGTCA 1260 
^TGATTAACTT' ATGAAAAAAT'T'ATTTGGGGA ^y«5SAGTGTG "OTACXTTTCCT" TGC?rriTrrr ~ ~ ' 1320 ' 

TTGCAGCCCT CAAATCCTAT CTTCCTGCCC CACAATGTGA GCAGCTACCC CTGATACTCC 1380 

TTTTCTTTAA TGATTTAACT ATCAACTTGA TAAATAACTT ATAGGTGATA GTGATAATTC 1440 

CTGATTCCAA GAATGCCATC TGATAAAAAA GAATAGAAAT GGAAAGTGGG ACTGAGAGGG 1500 

AGTCAGCAGG CATGCTGCGG TGGCGGTCAC TCCCTCTGCC ACTATCCCCA GGGAAGGAAA 1560 

RGCTCCGCCA TTTGGGAAAG TGGTTTCTAC GTCACTGGAC ACCGGTTCTG AGCATTAGTT 1620 

TGAGAACTCG TTCCCGAATG TGCTTTCCTC CCTCTCCCCT GCCCACCTCA AGTTTAATAA 1680 

ATAAGGTTGT ACTTTTCTTA CTATAAAATA AAAAAAAAAA AACTCGAGOG QGGCCCGGTA 1740 

CXCAAATCGC CX3GATATGAT CGTAAA 1766 



(2) INFORMATION FOR SEQ ID NO: 249: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2664 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 249: 

AGTGTCCTCG GAGCAGGCGG AGTAAAGGGA CTTGAGOGAG CX:AGTTGCCX3 GATTATTCTA 60 

10 TTTCCXrCTCC CTCTCTCCCG CCCCGTATCT CTTTTCACCC TTCTCCCACC CTCGCTCGCX3 120 

TASCATGGCG GAGCGTCGGC GGCCACTCAG TCCCATTCCA TCTCCTCGTC GTCCTTCGGA 180 

GCXTGAGCCGT CCGCGCCCGG CGGCGGCGGG AGCCCAGGAG CCTGCCCCGC CCTGGGGACG 240 

15 

AAGAGCTGCA GCTCCTCCTG TGCGGTGCAC GATCTGATTT TCTGGAGAGA TGTGAAGAAG 300 

ACTGGGrTTG TCTTTGGCAC CACGCTGATC ATGCTGCTTT CCCTGGCAGC TTTCAGTGTC 360 

20 ATCAGTGTGG TTTCTTACCT CATCCTGGCT CITCTCTCTG TCACCATCAG CTTCAGGATC 420 

TACAAGTCCG TCATCCAAGC TGTACAGAAG TCAGAAGAAG GCCATCCATT CAAAGCCTAC 480 

CTGGACGTAG ACATTACTCT GTCCTCAGAA GCTTTCCATA ATTACATGAA TGCTGCCATG 540 

25 

GTGCACATCA ACAGGGCCCT GAAACTCATT ATTCGTCTCT TTCTGGTAGA AGATCTGGTT 600 

GACTCCTTGA AGCTGGCTGT CTTCATGTGG CTGATGACCT ATGTTGGTGC TGTTTTTAAC 660 

30 GGAATCACCC TTCTAATTCT TGCTGAACTG CTCATTTTCA GTGTCCCGAT TGTCTATGAG 720 

AAGTACAAGA CCCAGATTGA TCACTATGTT GGCATCGCCC GAGATCAGAC CAAGTCAATT 780 

GTTGAAAAGA TCCAAGCAAA ACTCCCTGGA ATCGCCAAAA AAAAGGCAGA ATAAGTACAT 840 

35 

GGAAACCAGA AATGCAACAG TTACTAAAAC ACCATTTAAT AGTTATAACG TCGTTACTTG 900 

TACTATGAAG GAAAATACTC AGTGTCAGCT TGAGCCTGCA TTCCAAGCTT TTTTTTTAAT 960 

40 TTGGTGTTTT CTCCCATCCT TTCCCTTTAA CCCTCAGTAT CAAQCACAAA AATTGATGGA 1020 

CTGATAAAAG AACTATCTTA GAACTCAGAA GAAGAAAGAA TCAAATTCAT AGGATAAGTC 1080 

AATACCTTAA TGGTGGTAGA GCXTTTTACCT GTAGCTTGAA AGGGGAAAGA TTGGAGGTAA 1140 

45 

GAGAGAAAAT GAAAGAACAC CTCTGGGTCC TTCTGTCCAG TTTTCAGCAC TAGTCTTACT 1200 

CAGCTATCCA TTATAGTTTT GCCCTTAAGA AGTCATGATT AACTTATGAA AAAATTATTT 1260 

50 GGGGACAGGA GTGTGATACC TTCCTTGGTT TITTTTTGCA GCCCTCAAAT CCTATCTTCC 1320 

TGCCCCACAA TGTGAGCAGC TACCCCTGAT ACTCCmTC TTTAATGATT TAACTATCAA 1380 

CTTGATAAAT AACTTATAGG TGATAGTGAT AATTCCTGAT TCCAAGAATG CCATCTGATA 1440 

55 

AAAAAGAATA GAAATGGAAA GTGGGACTGA GAGGGAGTCA GCAGGCATGC TGCGGTGGCG 1500 

GTCACTCCCT CTGCCACTAT CCCCAGGGAA GGAAARGCTC CGCCATTTGG GAAAGTGGTT 1560 

60 TCTACGTCAC TGGACACCGG TTCTGAGCAT TAGTTTGAGA ACTCGTTCCC GAATGTGCTT 1620 
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TCCTCCCTCT CCXXTTGCCCA CCTCAAGTTT AATAAATAAG GTTGTACTTT TCTTACTATA 1680 



AAATAAATGT CTGTAACTGC TGTGCACTGC TGTAAACTTG TTAGAGAAAA AAATAACCTG 1740 

5 

CATGTCGGCT CCTCAGTTAT TGAGTTnTG TGATCCTATC TCAGTCTGGG GGGGAACAIT 1800 



CTCAAGAGGT GAAATACAGA AAGCCTTTTT TTCTTGATCT TTTCCCGAGA TTCAAATCTC 1860 



10 CGATTCCCAT TTGGGGGCAA GTmTTTCT TCACCTTCAA TATGAGAATT CAGCGAACTT 1920 



GAAAGAAAAA TCATCTGTGA GTTCCTTCAG GTTCTCACTC ATAGTCATGA TCCTICAGAG 1980 



GGAATATGCA CTGGCGAGTT TAAACTTAAGG GCTATGATAT TTGATGGTCC CAAAGTACGG 2040 

15 

CAGCTGCAAA AAGTAGTGGA AGGAAATTGT CTACX3TGTCT TGGAAAAATT AGTTAGGAAT 2100 



TTGGATGGGT AAAAGGTACC CTTGCCTTAC TCCATCTTAT TTTCTTAGCC CCCTTTGAGT 2160 



20 GTTTTAACTG GTTTCATGTC CTAGTAGGAA GTGCATTCTC CATCXTTCATC CTCTGCCCTC 2220 



CCAGGAAGTC AGTGATTGTC TTTTTGGGCT TCCCCTCCAA AGGACCTTCT GCAGTGGAAG 2280 



TGCCACATCC AGTTCTTTTC TTTTGTTGCT GCTGTGTrTA GATAATTGAA GAGATCTTTG 2340 

25 

TGCCACACAG GATTTTTTTT TTTTTTAAGA AAAACCTATA GATGAAAAAT TACTAATGAA 2400 



ACTGTGTGTA CGTGTCTGTG CGTGCAACAT AAAAATACAG TAGCACCTAA GGAGCTTGAA 2460 



30 TCTTQGTTCC TGTAAAATTT CAAATTGATG TGGTATTAAT AAAAAAAAAA AAAACAMAAA 2520 



AAAAAAAATWV AAAAGGGCGG CCGCTCTAGA GGATCCAAGC TTACGTACGC GTGCATGCX5A 2580 



CGTCCATAGC TCTTTCTATA GGGGTCCCCC AAATTCCATT CANCGGGCCG TCGGTTTTAN 2640 

35 

AAAGGTCGTG AI STTGG GGGAA ANCC _ _ 2S§^ 



40 

(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

50 

CGTGGGAGTG AGGTACCAGA TTCAGCCCAT TTGGCCCCGA CGCCTCTKTT CTCGGAATCC 60 
GGGTGCTGCG GATTGAQGTC CCGGTTCCTA ACGGTGGGAT CGGTGTCCTC GGGATGAGAT 120 



55 TTGGCGTTTC CTCGGGGCTT TGGTGGGATC GGTGTCCTCA GGATGAGATT TAGGGTTTCC 180 



TCGGGGCTTT CGGGATCTTC ACCTAATATC CGGACTGCAA GATGGAGGAA QGCGGGAACC 240 



TAGGAGGCCT GATTAARATG GTCCATCTAC TGGTCTTGTC AGGTGCCTGG GGCATGCAAA 300 

60 
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TGTGGGTGAC CTTCGTCTCA GGCTTCCTGC TTTTCCGAAG CCTTCCCCGA CATACCTTCG 360 

GACTAGTGCA GAGCAAACTC TTCCCCTTCT ACTTCCACAT CTCCATGGGC TGTGCCTTCA 420 

5 TCAACCTCTG CATCTTGGCT TCACAGCATG CTTGGGCTCA GCTCACATTC TGGGAGGCCA 480 

GCX:AGCTTTA CCTGCTCTTC CTGAGCCTTA CGCTCGCCAC TGTCAACX3CC CGCTGGCTGG 540 

AACC<XX3CAC CACAGCTGCC ATGTGGGCCC TGCAAACCGT GGAGAAGGAG CGAGGCCTGG 600 

10 

GTGGGGAGGT ACCAGGCAGC CACCAGGGTC CCGATCCXTTA CCGCCAGCTG CX3AGAGAAGG 660 

ACCCCAAGTA CAGTGCTCTC CGCCAGAATT TCTTCCGCTA CCATGGGCTG TCCTCTCTTT 720 

15 GCAATCTGGG CTGCGTCCTG AGCAATGGGC TCTGTCTCGC TGGCCTTGCC CTGGAAATAA 780 

GGAGCCTCTA GCATGGGCCC TGCATGCTAA TAAATGCTTC TTCAGAAAAA AAAAAAAAAA 840 

AAACTCGAGG GGGGCCCGGT ACCCA 865 

20 



25 



35 



(2) INFORMATION FOR SEQ ID NO: 251: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEJNESS : double 
30 (D) TOPOLOGFY: lineax 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 
TGGGGGGGGN AATGGGTGTC TGGCTCANGG ATTGCCNAAT CTGGAAATTC TCCATAACTT 60 
GCTAGCTTGT 'i'i' I ' m 'l Ti ''i ' TTTTTTTACA CCCCCCCGCC CCACCCCCGG ACTTGCACAA 120 



TGTTCAATGA TCTCAGCAGA GTTCTTCATG TGAAACGTTG ATCACCTTTG AAGCCTGCAT 180 

40 CATTCACATA TTTTTTCTTC TTCTTCCCCT TCAGTTCATG AACTGGTGTr CATTTTCTGT 240 

GTGTGTGTGT GTTTTATTTT GTTTGGATTT' TTTTTTTTAA TTTTACTTTT AGAGCTTGCT 300 

GTGTTGCCCA CCTTTTTTCC AACCTCCACC CTCACTCCTT CTCAACCCAT CTCTTCCGAG 360 

45 

ATGAAAGAAA AAAAAAAGCA AAGTTTTTTT TTCTTCTCCT GAGTTCTTCA TGTGAGATTG 420 

AGCTTGCAAA GGAAAAAAAA ATGTGAAATG TTATAGACTT GCAGCGTGCC GAGTTCCATC 480 

50 GGGTnTTTT TTTAGCATTG TTATGCTAAA ATAGAGAAAA AAATGCTCAT GAACCTTCCA 540 

CAATCAAGCC TGCATCAACC TTCTGGGTGT GACTTGTGAG TTTTGGCCTT GTGATGCCAA 600 

ATCTGAGAGT TTAGTCTGCC ATTAAAAAAA CTCATTCTCA TCTCATGCAT TATTATGCTT 660 

55 

GCTACTTTGT CTTAGCAACA ATGAACTATA ACTGTTTCAA AGACTTTATG GAAAAGAGAC 720 

ATTATATTAA TAAAAAAAAA AAGCCTGCAT GCTGGACATG TATGGTATAA TTATTTTTTC 780 

60 cTri ' i ' ri ' rrx ^ ccttttggct tggaaatgga cgttcgaaga cttatagcat ggcattcata 840 
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CTTTTGTTTT ATTGCCTCAT GACriTTTTTG AGTTTAGAAC AAAACAC3TGC 
CCTTCTTCCC ATGAAATTTT GCATCTGCTC CAAAACTGCT TTGAGTTACT 
ACCTCCCAAT GCACTGAAGG CATTCCTTGT GCAAAGATAC CAGAATGGGT 
ACCTGGCAAA CATTGAAGAA CTCTTRATGT I ITCTT I TT A ATAAGAATGA 
TGGGGACTAA AATTGTGCTA TTGCCGAGAA GCAGTCTAAA ATTTATTTTT 
AACTGCCCCA TTATTTTTGG TrrGTTTTAT ITTTATTTTA TATTTTTTGG 
TTGTCAAATG TGGAATGCTC TGGGTTTCTA GTATATAATT TAATTCTAGT 
TGTTAGCCCA GTTAAAATGT ATGCTACAGA TAAAGGAATG TTATAGATAA 
GTTAGGTCTG TTTAGCTGTA GATTTTTTAA ACGATTGATG CACTAAATTG 
TGATGTTAAG GGGGGTAGAG TTTGCAAGGG GACTGTTTAA AAAAAGTAGC 
TGTGCTTGCA ACTTAAATAT AAGTTGGGTA TGTGTAGTCT TTGCTATACC 
TTGAAAACCA AAGTATTAAG AGGGGAAACG CCCCTGTTTA TATCTGTAGG 
ATTCAAAAAT GTATGTTTTT TTTTCTTTTC AAAATTAAAG TATTTGGGAC 
TAAGATATAA CCTGCAAGCA TATAATACAA AAAAAAATTG CAAAACTGTT 
ATAAAATTTA TGCAGTTATA AAAATGGCAT TACTGCACAG TTTTAAGATG 
TTTTACAGTT GTATTGTGGT GCAGAACTGG ATTTTCTGTA ACTTAAAAAA 
TTTTAAAGGC AATAATCAGT AAATGTTATT TTCAGGGACT GACATCCTGT 
AAATGAAAAG TAAATCTTAC CACAATAAAT ATAAAAAAAT CTTGTCAGTT 



AACCGTAGAG 
CAGAACTTCA 
TACACATTTA 
CGCCCCACTT 
TAAAAAGAGA 
CTTTTGGTCA 
TTTTATAATC 
ATTTGAAAGA 
TTTACTATTG 
TTATACAGCA 
ACTGACTGTA 
GGTATTTTAC 
TGAATTGCAC 
TAGAACX3CTA 
ATGCAGATTT 
AAATCCACAG 
CTTTAAAAAG 
ACTTTTCTTT 



TACATATTTT GCTGTGCAAA ATTGTTTTAT ATCTTGAGTT ACTAACTAAC 
GTTCCTATGT GCTTTTCTTT CATTTTCAAT TCTGGTTATA TCAAGAAAAG 
AATAATAAAC GGCATTTTTT TTTGAAAAAA AAAAAAAAAA AA 



CACGCGTGTT 
AATAATCTAC 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2082 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO; 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I£NGTH: 1482 bcLse pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOIOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252; 
CAGGCAGGCT GGCCCCGGGG ACTTCTCTCT GGCCCTGCTC CCTCCGAGCG CTCCGCCGTT 
GCCCGCCTGG CCCCTACGGA GTCCTTAGCC AGGATGGAGG CTGTTGTGAA CTTGTACCAA 



60 
120 
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GAGGTGATGA AGCACGCAGA TCCCCGGATC CAGGGCTACC CTCTGATGGG GTCCCCCTTG 180 

CTAATGACCT CCATTCTCCT GACCTACGTG TACITCGTTC TCTCACTTGG GCCTCGCATC 240 

ATGGCTAATC GGAAGCCCTT CCAGCTCCGT GGCTTCATGA TTGTCTACAA CTTCTCACTG 300 

GTGGCACTCT CCCTCTACAT TGTCTATGAG TTCCTGATGT CGGGCTGGCT GAGCACCTAT 360 

ACXrrGGCXSCT GTGACCCTGT GGACTATTCC AACAGCCCTG AGGCACTTAG GATGGTTCGG 420 

GTGGCCTQGC TCTTCCTCTT CTCCAAGTTC ATTGAGCTGA TGGACACAGT GATCTTTATT 480 

CTCCGAAAGA AAGACGGGCA GGTGACCTTC CTACATGTCT TCCATCACTC TGTGCTTCCC 540 

TGGAGCTGGT GGTGGGGGGT AAAGATTGCC CCGGGAGGAA TGGGCTCTTT CCATGCCATG 600 

ATAAACTCTT CCGTGCATGT CATAATGTAC CTGTACTACG GATTATCTGC CTTTGGCCCT 660 

GTGGCACAAC CCTACCTTTG GTGGAAAAAG CACATGACAG CCAITCAGCT GATCCAGTTT 720 

GTCCTGGTCT CACTGCACAT CTCCCAGTAC TACTTTATGT CCAGCTGTAA CTACCAGTAC 780 

CCAGTCATTA TTCACCTCAT CTGGATGTAT GGCACCATCT TCTTCATGCT GTTCTCCAAC 840 

TTCTGGTATC ACTCTTATAC CAAGGGCAAG CGGCTGCCCC GTGCACTTCA GCAAAATGGA 900 

GCTCCAGGTA TTGCCAAGGT CAAGGCCAAC TGAGAAGCAT GGCCTAGATA GGCGCCCACC 960 

TAAGTGCCTC AGGACTTGCAC CTTAGGGCAG TGTCCGTCAG TGCCCTCTCC ACCTACACCT 1020 

GTGACCAAGG CTrATGTGGT CAGGACTGAG CAGGGGACTG GCCCTCCCXTT CCCCACAGCT 1080 

GCTCTACAGG GACCACGGCT TTGGTTCCTC ACCCACTTCC CCCGGGCAGC TCCAGGGATG 1140 

TGGCCTCATT GCTGTCTGCC ACTCCAGAGC TGGGGGCTAA AAGGGCTGTA CAOrTATTTC 1200 

_CCCCTCanX3_JCCOTAAAACT_TGGGAGAGGA^^^^ 1260" 

GTGGCCTTTT TCCTCACACA GAAGAGGTCA GCAATAATGT CACTGTGGAC CCAGTCTCAC 1320 

TCCTCCACCC CACACACTGA AGCAGTAGCT TCTGGGCCAA AGGTCAGGGT GGGCX3GGGGC 1380 

CTGGGAATAC AGCCTGTGGA GGCTGCXTAC TCAACTTGTG TCTTAATTAA AAGTGACAGA 1440 

GGAAACCAAA AAAAAAAAAA AAAAACTCGA GGGGGGCCCG TA 1482 



50 (2> INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 
60 GGCACGAGCG CCGTTGCCCG CCTGGCCCCT ACGGAGTCCT TAGCCAGGAT GGAGGCTGTT 



60 



SDOCID: <WO_983944aA?^IA> 
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GTGAACTTGT ACCAAGAGGT GATGAAGCAC 
ATGGGGTCCC CCTTGCTAAT GACCTCCATT 
CTTGGGCCTC GCATCATGGC TAATCGGAAG 
TACAACTTCT CACTGGTGGC ACTCTCCCTC 
TGGCTGAGCA CCTATACCTG GCGCTGTGAC 
TCAGTGCCCT CTCCAMCTAC ACXTTCTGACC 
ACTGGCCCTC CCCTCCCXIAC AGCTGCTCTA 
TTCCCCCX3GG CAGCTCCAGG GATGTGGCCT 
CTAAAAGGGC TGTACAGTTA TTTCCCCX^H:: 
AGGGCTGGCC CCACAAAGGG TCTCGTGGCC 
ATGTCACTGT GGACCCAGTC TCACTCCTCC 
CCAAAGGTCA GGGTGGGCGG GGGCCTGGGA 
TGTGTCTTAA TTAAAAGTGA CAGAGGAAAC 



GCAGATCCCC GGATCCAGGG CTACXrCTCTG 
CTCCTGACCT ACGTGTACTT CGTTCTCTCA 
CCCTTCCAGC TCCGTGGCTT CATGATTGTC 
TACATTGTCT ATGAGTTCCT GATGTCGGGC 
CXrrCAGGACT GCACCTTAGG GCAGTGTCCG 
AAGGCTTATG TGGTCAGGAC TGAGCAGGGG 
CAGGGACCAC GGCTTTGGTT CXTTCACXXrAC 
CATTGCTGTC TGCCACTCCA GAGCTGGGGG 
CCTGCCTTAA AACTTGGGAG AGGAGCACTC 
TTTTTCCTCA CACAGAAGAG GTCAGCAATA 
ACCCCACACA CTGAAGCAGT AGCTTCTGGG 
ATACAGCCTG TGGAGGCTGC TTACTCAACT 
CACGAAAAAA AAAAAAAAAA AAAA 



(2) INFORMATION FOR SEQ ID NO: 254 r 

(i) SEQUENCE CHARACTERISTICS: 

(A) I£NGTH; 1508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

TOPOLOGV: linear 

(Xi) SEQUENCE DESCRIPTICW: SEQ ID NO: 254: 

TTGAACTTTT AAAATTTTAG ATCAGCAAAC TCTAAGATCC TAGAATGGAA GCTGTTCCTC 

ATTTCTCCAT GCTCACCCTC CCAGGTCAGC GAGATGGTGA AGAAGCTGCA CGCGGCAACA 

CCACCAACGT TCGGAGTQGA CCTCATCAAT GAGCTTGTGG AGAACTTTGG CAGATGTCCC 

AAGTGGTCTG GTCGGCAAGC CTTTGTCTTT GTCTGCCAGA CTGTCATTGA GGATGACTGC 

CrrCCCATGG ACCAGTTTGC TGTGCATCTC ATGCCGCATC TGCTAACCTT AGCAAATGAC 

AGGGrrCCTA ACGTGCGAGT GCTGCTTGCA AAGACATTAA GACAAACTCT ACTAGAAAAA 

GACTATTTCT TGGCCTCTGC CAGCTGCCAC CAGGAGGCTG TGGAGCAGAC CATCATGGCT 

CTTCAGATGG ACCGTGACAG CGATGTCAAG TATTTTGCAA GCATCCACCC TGCCAGTACC 

AAAATCTCCG AAGATGCCAT GAGCACAGCG TCCTCAACCT ACTAGAAGGC TTGAATCTCG 

GTGTCTTTCC TGCTTCCATG AGAGCCGAGG TTCAGTGGGC ATTCGCCACG CATGTGACCT 
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GGGATAGCTT TCGGGGGAGG AGAGACCTTC CTCTCCTGCG GACTTCATTG CAGGTX3CAAG 660 

TTGCCTACAC CCAATACCAG GGATTTCAAG AGTCAAGAGA AAGTACAGTA AACACTATTA 720 

TCTTATCTTG ACTTTAAGGG GAAATAATTT CTCAGAGGAT TATAATTGTC ACCGAAGCCT 780 

TAAATCCTTC TGTCTTCCTG ACTGAATGAA ACTTGAATTG GCAGAGCA1T TTCCTTATGG 840 

AAGGGATGAG ATTCCCAGAG ACCTGCATTG CTTTCTCCTG GTTTTATTTA ACAATCGACA 900 

AATGAAATTC TTACAGCCTG AAGGCAGACG TGTGCCCAGA TGTGAAAGAG ACCTTCAGTA 960 

TCAGCCCTAA CTCTTCTCTC CCAGGAAGGA CTTGCTGGGC TCTGTGGCCA GCTGTCCAGC 1020 

CCAGCCCTGT GTGTGAATCG TTTGTGACGT GTGCAAATGG GAAAGGAGGG GTTTTTACAT 1080 

CTCCTAAAGG ACXTTGATGCC AACACAAGTA GGATTGACTT AAACTCTTAA GCGCAGCATA 1140 

TTGCTGTACA CATTTACAGA ATGGTTGCTG AGTGTCTGTG TCTGATTTTr TCATGCTGGT 1200 

CATGACCTGA AGGAAATTTA TTAGACGTAT AATGTATGTC TGGTGTTOTT AACTTGATCA 1260 

TGATCAGCTC TGAGGTGCAA CTTCTTCACA TACTGTACAT ACCTGTGACC ACTCTTGGGA 1320 

GTGCTGCAGT CTTTAATCAT GCTGTTTAAA CTGTTGTGGC ACAAGTTCTC TTGTCCAAAT 1380 

AAAATTTATT AATAAGATCT ATAGAGAGAG ATATATACAC TTTTGATTGT TTTCTAGATG 1440 

TCTACXIAATA AATGCAATTT GTGACCTGTA TTAAAAAAAA NTAAAAAAAC TCGAGGGGGG 1500 

CCCGGTAC 1508 



35 

(2) INFORMATION FOR SEQ ID NO: 255: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2514 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

45 

GAGAGACTCA CACTTCTTTT CCATTATCAC TGACGATGTA GTGGACATAG CAGGGGAAGA 60 

GCACCTACCT GTGTTGGTGA GGTTTGTTGA TGAATCTCAT T^ACCTAAGAG AGGAATTTAT 120 

50 AGGCTTCCTG CCTTATGAAG CCGATGCAGA AATTTTGGCT GTGAAATTTC ACACTATGAT 180 

AACTGAGAAG TGGGGATTAA ATATGGAGTA TTGTCGTGGC CAGGCTTACA TTGWCTCTAG 240 

TGGATTTTCT TCCAAAATGA AAGTTGTTGC TTCTAGACTT TTAGAGAAAT ATCCCCAAGC 300 

55 

TATCTACACA CTCTGCTCTT CCTGTGCCTT AAATATGTGG TTGGCAAAAT CAGTACCTGT 360 

TATGGGAGTA TCTGTTGCAT TAGGAACAAT TGAGGAAGTT TGTTCTTTTT TCCATCGATC 420 

60 ACCACAACTG CTTTTAGAAC TTGACAACGT AATTTCTGTT CTTTTTCAGA ACAGTAAAGA 480 
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AAGGGGTAAA GAACTGAAGG AAATCTGCCA TTCTCAGTGG ACAGGCAGGC ATGATGCTTT 540 

TGAAATTTTA GTGGAACTCC TGCAAGCACT TCTTTTATGT TTAGATGGTA TAAATAGTGA 600 

5 

CACAAATATT AGATGGAATA ACTATATAGC TGGCCX3AGCA TTTGTACTCT GCAGTGCAGT 660 

GTCAGATTTT GATTTCATTG TTACTATTGT TGTTCTTAAA AATGTCCTAT CTTTTACAAG 720 

10 AGCCTTTGGG AAAAACCTCC AGGGGCAAAC CTCTGATGTC TTCTTTGCGG CX:GGTAGCTT 780 

GACTGCAGTA CTGCATTCAC TCAACGAAGT GATTGGAAAA TATTGAAC3TT TATCATGAAT 840 

TTTGGTTTGA GGAAGCCACA AATTTGGCAA CX::AAACTTGA TATTCAAATG AAACTCCCTG 900 

15 

GGAAATTCCG CAGAGCTCAC CAGGGTAACT TGGAATCTCA GCTAACCTCT GAGAGTTACT 960 

ATAAAGAAAC CCTAAGTGTC CCAACAGTGG AGCACATTAT TCAGGAACTT AAAGATATAT 1020 

20 TCTCAGAACA GCACCTCAAA GCTCTTAAAT GCTTATCTCT GGTACCCTCA GTCATGGGAC 1080 

AACTCAAATT CAATACGTCG GAGGAACACC ATGCTGACAT GTATAGAAGT GACTTACCCA 1140 

ATCCTGACAC GCTGTCAGCT GAGCTTCATT GTTGGAGAAT CAAATGGAAA CACAGGGGGA 1200 

25 

AAGATATAGA GCTTCCX5TCC ACCATCTATG AAGCCCTCCA CCTGCCTGAC ATCAAGTTTT 1260 

TTCCTAATGT GTATGCATTG CTGAAGGTCC TGTGTATTCT TCCTGTGATG AAGGTTGAGA 1320 

30 ATGAGCQGTA TGAAAATGGA CGAAAGCX5TC TTAAAGCATA TTTGAGGAAC ACTTTGACAG 1380 

ACCAAAGGTC A/^AACTTG GCTTTGCTTA ACATAAATTT TGATATAAAA CACGACCTGG 1440 

ATTTAATGGT GGACACATAT ATTAAACTCT ATACAAGTAA GTCAGAGCTT CCTACAGATA 1500 

35 

ATTCCGAAAC TGTGGAAAAT ACCTAAGAGA Cg?TT TAAAAA TAGGCTTTCT TA TATTTGAT A.560 

ATTTQGAAGA AAAAGCCGTA AGTGTATGTA GACCACTTAA TCACTAAATA TCTTTGCCTA 1620 

40 TAGGACTCCA TTGAATACAT TAGCCATTGA TAATCTACCT GTTTAAATGG CCCCTGTTTG 1680 

AACTCTCAAG CTTTGAAGAC CTACCTGTTC TTCCAGAAGA GAACGTTGAA AGTGCCATGT 1740 

TTCCTTTTGC GTGATCTCTC TTGATGGCAC TCTGGAATTG TTTCAGTTAA GTCATTTTAG 1800 

45 

ACATAGCATT TATTATCACT GTGGATCTCT ACTTGTTGGG TGTTATGAAT TCTTTGAAGA 1860 

AATATATTTT GAAGAGGTGT GGGAGGAAGG AATACATTTT ATAAAATGTT GTAGTGAAGC 1920 

50 CCACAATTGA CCTTTGACTA ATAGGAGTTT TAAGTATGTT AAAAATCTAT ACTGGACAGT 1980 

TACAAGAAAT TACCX3GAGAA AAGCTTGTGA GCTCACCAAA CAAGGATTTC AGTGTAGATT 2040 

TTGTCTTTCT TGAACTTAAA GAAACAAATG ACAAAGITTG AATGGAAAAG CCTGCTGTTG 2100 

55 

TTCCACATCT CGTTGCTGTT TACATTCCTT TGTQGAGCCT ACATCTTCCT AAGCTTTTTA 2160 

GCAGGTATAT GTTGAACACT TCTGTTTCAT GGTTGAGACA GAATCAGAGG CCATGGATAC 2220 

60 TGACAACTGA TTTGTCTGTT TTTTTTCTCT GTCTTTTTCC ATGACTCTTA TATACTGCCT 2280 
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CATCTTGATT TATAAGCAAA ACCTGGAAAA CCTACAAAAT AAGTGTTGTG GTTTATCTAG 2340 

AAAAATATGG AAAATATTGC TGTTATTTTT GGTGAAGAAA ATCAATTTTG TATAGTTTAT 2400 

TTCAATCTAA ATAAAATGTG AATTTTGTTT AAAGCTTAGG CACATTATTT TTTGTGGGGT 2460 

CAAAACATTC TTGTGTAAAT TCTCTTAAAC ATTTGATAAA CAGCTTCACA ATTC 2514 
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(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

CTGCCTTATG AAGCCGATGC AGAAATTTTG GCTGTGAAAT TTCT^CACTAT GATAACTGAG 60 

AAGTGGGGAT TAAATATGGA GTATTGTCGT GGCCAGGCTT ACATTGTCTC TAGTGGATTT 120 

TCTTCCAAAA TGAAAGTTGT TGCTTCTAGA CTTTTAGAGA AATATCCCCA AGCTATCTAC 180 

ACACTCTGCT CTTCCTGTGC CTTAAATATG TGGTTGGCAA AATCAGTACC TGTTATOGGA 240 

GTATCTGTTG CATTAGGAAC AATTGAGGAA GTTTGTTCTT TTTTCCATCG ATCACCACAA 300 

CTGCTTTTAG AACTTGACAA CGTAATTYCT GTTCTTTTTC AGAACAGTAA AGAAAGGGGT 360 

AAAGAACTGA AGGAAATCTG CCATTCTCAG TGGACAGGCA GGCATGATGC TTTTGAAATT 420 

. TTAGTGG AAC TCCT GCAAGC ACTTGTTTTA TG!mAGATG_GTATAAATAG -TGACACAAAT ABO - 

ATTAGATGGA ATAACTATAT AGCTGGCCGA GCATTTGTAC TCTGCAGTGC AGTGTCAGAT 540 

TTTGATTTCA TTGTTACTAT TGTTGTTCTT AAAAATGTCC TATCTTTTAC AAGAGCCTTT 600 

GGGAAAAACC TCCAGGGGCA AACCTCTGAT GTCTTCTTTG CGGCCGGTAG CTTGACTGCA 660 

GTACTGCATT CACTCAACGA AGTGANTGGA AAATATTGAA GTTTATCATG AATTTTGGTT 720 

TGAGGAAGCC ACAAATTTGG CAACCAAACT TGATATTCAA ATGAAACTCC CTGOGAAATT 780 

CCGCAGAGCT CACCAGGGTA ACTTGGAATC TCAGCTAACC TCTGAGAGTT ACTATAAAGA 840 

AACCCTAAGT GTCCCAACAG TGGAGCACAT TATTCAGGAA CTTAAAGATA TATTCTCAGA 900 

ACAGCACCTC AAAGCTCTTA AATGCTTATC TCTGGTACCC TCAGTCATGG GACAACTCAA 960 

ATTCAATACG TCGGAGGAAC ACCATGCTGA CATGTATAGA AGTGACTTAC CCAATCCTGA 1020 

CACGCTGTCA GCTGAGCTTC ATTGTTGGAG AATCAAATGG AAACACAGGG GGAAAGATAT 1080 

AGAGCTTCCG TCCACCATCT ATGAAGCCCT CCACCTGCCT GACATCAAGT TTTTTCCTAA 1140 
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TGTGTATGCA TTGCTGAAGG TCCTGTGTAT TCTTCCTGTG ATGAAGGTTG AGAATGAGCG 1200 

GTATGAAAAT GGACGAAAGC GTCTTAAAGC ATATTTGAGG AACACTTTGA CAGACCAAAG 1260 

GTCAAGTAAC TTGGCTTTGC TTAACATAAA TTTTGATATA AAACACGACC TGGATTTAAT 1320 

GGTGGACACA TATATTAAAC TCTATACAAG TAAGTCAGAG CTTCCTACAG ATAATTCCGA 1380 

AACTGTGGAA AATACCTAAG AGACTTTTAA AAATAGGCTT TCTTATATTT GATATTTGGA 1440 

AGAAAAAGCC GTAAGTGTAT GTAGACCACT TAATCACTAA ATATCTTTGC CTATAGGACT 1500 

CCATTGAATA CATTAGCCAT TGATAATCTA CCTGTTTAAA TGGCXrCCTGT TTGAACTCTC 1560 

AAGCTTTGAA GACCTACCTG TTCTTCCAGA AGAGAACGTT GAAAGTGCCA TGTTTCCTTT 1620 

TGCGTGATCT CTGTTGATGG CACTCTGGAA TTGTTTCAGT TAAGTCATTT TAGACATAGC 1680 

ATTTATTATC ACTGTGGATC TCTACTTGTT GGGTGTTATG AATTCTTTGA AGAAATATAT 1740 

TTTGAAGAGG TGTGGGAGGA AGGAATACAT TTTATAAAAT GTTGTAGTGA AGCCCACAAT 1800 

TGACCTTTGA CTAATAGGAG TTTTAAGTAT GTTAAAAATC TATACTGGAC AGTTACAAGA 1860 

AATTACCGGA GAAAAGCTTG TGAGCTCACC AAACAAGGAT TTCAGTGTAG ATTTTGTCTT 1920 

TCTTGAACTT AAAGAAACAA ATGACAAAGT TTGAATGGAA AAGCCTGCTG TTGTTCCACA 1980 

TCTCGTTGCT GTTTACATTC CTTTGTGGAG CCTACATCTT CCTAAGCTTT TTAGCAGGTA 2040 

TATGTTGAAC ACTTCTGTTT CATGGTTGAG ACAGAATCAG AGGCCATGGA TACTGACAAC 2100 

TGATTTGTCT GTTTTTTTTC TCTGTCTTTT TCCATGACTC TTATATACTG CCTCATCTTG 2160 

ATTTATAAGC AAAACCTGGA AAACCTACAA AATAAGTGTT GTGGTTTATC TAGAAAAATA 2220 



40 



TGGAAAATAT TGCTGTTATT TTTGGTGAAG AAAATCAATT TTGTATAGTT TATTTCAATC 2280 
TAAATAAAAT GTGAATTTTG TTTAAAGCTT AGGCACATTA TTTTTTGTGG GGTCAAAACA 2340 
TTCTTGTGTA AATTCTC 2357 
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(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 689 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 
ACTTTCTGGT GCAAAAAGAT GTTCAAGCCT TATTTTATAC TTGCCTGCCC CTTTCTCTTT 
CATTTATTGG AGTGAGCTGC AGCTCTAAGA AGACCTGTTC TTTTGAATGG AGAGTAGCAT 
CAGGAACCAG GATGTGGGTG CGAGGCGTGC TCCTGGCTGT TGCAGATTGC TGCACCCGGG 



60 
120 
180 
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AGCTCTTAGT GGACAGAGCT AGAGGATATG TGCACGTACT TCCATCTCTC TCTCTGTCTC 240 

CGATTTTAGC CCAGCACCAC AGGGTACGTT CCAGTTTTTC TCTCTTTCCA fAGCTGTAAG 300 

5 

GCCCTTTCTG GGAATGGTTC TCATTCTCCT TAATCTATTA TTGOGTCAGT TTTCCTGCAT 360 

GTCCCXAGCC TCCCATCACT GCCACCCACT CCCCACAGAG ATGCCCTGCT CATCCX3ACTG 420 

10 GGGCTTTGAC TCCCACACTG TCTACCCXTTC TTGTGTGGAC GCCCTGCTGC CAAAACCTTC 480 

AGCAAACAGC TTTCCAAATG GAAGTTGTCA CTGTCARGGS CTTTACAATC AGCAACAGCA 540 

AAATCTACAT GCTGCTGAGG GTCCTGCCTC ATTAAGATGC AATAAATATG TAAGTACATA 600 

15 

AAAACAGCAA TAGAAGAAAC GTAATGCTTT ATTCTCAAAT ATGNATGTCT ACATAGAAAA 660 

GCCAAAATTA TTAAGAATAG TAAGGAATT 689 

20 



(2) INFORMATION FOR SEQ ID NO: 258: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 



TCGACCCACG CGTCCGCCGA TGTGATGATT CCTGCGTATT CCAAGAACCG GGCCTATGCC 60 

35 ATCTTCTTCA TAGTCTTCAC TGTGATAGGG GACGCCCCCG GCGCTGTGCT ATCCTGTGCC 120 

- - GGCCACCCTX-X3CGTTGGTTT-TGGTGGTGTA-GTGG^^^ i&O 

TCTTGAAGGA AGCCTGTTTC TGATGAACCT GCTGACAGCC ATCATCTACA GTCAGTTCCG 240 

40 

GGGCTACCTG ATGAAATCTC TCCAGACCTC GCTGTTTCGG AGGCGGGTGG GAACCCGGCT 300 

GCCTTTGAAG TCCTATCCTC CATGGTGGGG GAGGGAGGAG CCTTCCCTCA GGCAGTTGGG 360 

45 GTGAAGCCCC AGAACTTGCT GCAGGTGCTT CAGAAGGTCC AGCTGGACAG CTCCCACAGA 420 

CAGGCCATGA TGGAGAAGGT GCGTTCCTAT GGCAGTGTTC TGCTCTCAGC TGAGGAGTTT 480 

CAGAAGCTCT TCAACGAGCT TGACAGAAGT GTGGTTAAAG AGCACCCGCC GAGGCCCGAG 540 

50 

TACCAGTCTC CGTTTCTGCA GAGCGNCCCA GTTCCTCTTC GGCCACTNAC TACTTTGACT 600 

ACCTGGGGAA CCTCATCGCC CTGGCAAACC TGGTGTCCAT TTGCGTGTTC CTGGTGCTGG 660 

55 ATGCAGATGT TGCTGCCTGC TGAGCGTGAT GACTTCATCC TGGGGGGTCT CAACTGCGTC 720 

TTCATTGTGT ACTACCTGTT GGAGATGCTG GCTCAAGGTC TTTTGCCCTG GGGCCTGCGA 780 

RGGTACYKKT CCTAACCCCA RCAAMGTGTT TTGAACGGGC TCCTCAMCGT TTGTCCTGGC 840 
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TGGWWKKGSM GATCTCAACT CTGGCTGTGT ACCGATTGCC ACACCCAGGC TGGAGGCCGG 900 

ANATGGTGGG CCTGCTGTCG CTGTGGGACA TGACCCGCAT ACTGAACATG CTCATCGTGT 960 

TCCGCTTCCT GCGTATCATC CCCAGCATGA AGCCGATGGC CGTGGTGGCC AGTACCGTCC 1020 

TGGGCCTGGT GCAAAACATG CGTGCGTTTG GCXXX3ATCCT GGTGGTOGTC TACTACGTAT 1080 

TTGCCATCAT TGGGATCAAC TTGTTTAGAG GCGTCATTGT GGCTCTTCCT GGAAACAGGA 1140 

GCCTGGCCCC TGCCAATAGG TCGGCGCCCT GTGGGAGCTT CGAGCAGCTG GAGTACTGGG 1200 

CCAACAACTT CGATGACTTT GCGGCTGCCC TGGTCACTCT GrGGAACTTG ATGGTGGTGA 1260 

ACAACTGGCA GGTCTTTCTG GATGCATATC GGCGCTACTA AGGCCCGTGG TCCAAGATCT 1320 

ATTTTGTATT GTGGTGGCTG GTGTCGTCTG TCATCTGGGT CAACCTGTTT CTQGCCCTGA 1380 

TTCTGGAGAA CTTCCTTCAC AAGTGGGACC CCCGCAGCCA CCTGCAGCCC CTTGCTGGGA 1440 

CCCCA3AGGC CACCTACCAG ATGACTGTGG AGCTCCTGTT CAGGGATATT CTGGAGGAGC 1500 

CCGGGGAGGA TGAGCTCACA GAGAGGCTGA GCCAGCACCC GCACCTGTGG CTGTGCAGGT 1560 

GACGTCCGGG TCTGCCATCC CAGCAGGGGC GGCAGGAGAG AGAGGCTGGC ATAACACAGG 1620 

TGCCCATCAT GGAAGAGGCG GCCATGCTGT GGCCAGCCAG GCAGGAAGAG ACCTTTCCTC 1680 

TGACGGACCA CTAAGCTGGG GACAGGAACC AAGTCCTTTG CGTGTGGCCC AACAACCATT 1740 

TACAGAACAG CTGCTGGTGC TTCAGGGAGG CX3CCGTGCCC TCCX3CTTTCT TTTATAGCTG 1800 

CTTCAGTGAG AATTCCCTTG TCGACTCCAC AGGGACCTTT CAGACAAAAA TGCAAGAAGC 1860 

AGCGGCCTCC CCTGTCCCCT GCAGCTTCCG TGGTGCCTTT GCTGCCGGCA GCCCTTGGGG 1920 



-ACCACAGGCC -TGAXXAGGGC-CTGCAOi^ CATTCAGCTG 1980 

GGAATGATAC TAATACCTCC GATTTTAGCC CAGCACCACA GGGTACGrTTC CAGTTTTTAT 2040 

TTCTTTCCAT AGCTGTAAGG CCCTTTCTGG GAATGGTTAT CATTCTCCTT AATCTATTAT 2100 

TGGGTCAGTT TTCCTGCATG TCCCCAGCCT CXICATCACTG CCACTCACTC CCCACAGAGA 2160 

TGCCCTGCTC ATCCX5ACTGG GGCTTTGACT CCCACACTGT GTACCCCTCT TGTGTGGACG 2220 

CCCTGCTGCC AAAACXTTTCA GCAAACAGCT TTCCAAATGG AAGTTGTCAC TGTCAGGGCC 2280 

TTTACAATCA GCAACAGCAA AATCTACATG CTGCTGAGGG TCCTGCCTCA TTAAGATGCA 2340 

ATAAATATGT AAGTACATAA AAAAAAAAAA AAAAAAA 2377 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 1193 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : dpuble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

- 5 

TCTGNTCGCC GTCGCCCCGC CCCTGGCCTT TGCCCGGTCG GGCGGGACTT CCTGTGTCGT 60 

ATTTCCAAGG ACTCCAAAGC GAGGCCGGGG ACTGAAGGTG TGGGTGTCGA GCCCTCTGGC 120 

10 AGAGGGTTAA CCTGGGTCAA ATGCACGGAT TCTCACCTCG TACAGTTACG CTCTCCCGCG 180 

GCACGTCCGC GAGGMYTTGA AGTCCTGAGC GCTCAAGTTT GTCCGTAGTC GAGAGAAGGC 240 

CATGGAGGTG CCGCCACCGG CACCGCGGAG CTTTCTCTGT AGAGCATTGT GCCTATTTCC 300 

15 

CCGAGTCTTT GCTGCCGAAG CTGTGACTGC CGATTCGGAA GTCCTTGAGG AGCGTCAGAA 360 

GCGGCTTCCC TACGTCCCAG AGCCCTATTA CCCGGAATCT GGATGGGACC GCCTCCGGGA 420 

20 GCTGrrTGGC AAAGACACAG TGAACACTAG TCTGAATGTA TACCGAAATA AAGATGCCTT 480 

AAGCCATTTT GTAATTGCAG GAGCTGTCAC GGGAAGTCTT TTTAGGATAA ACGTAGGCCT 540 

GCGTGGCTGG TGGCTGGTGG CATAATTGGA GCCTTGCTGG GCACTCCTGT AGGAGGCCTG 600 

25 

CTGATGGCAT TTCAGAAGTA CTCTGGTGAG ACTGTTCAGG AAAGAAAACA GAAGGATCGA 660 

AAGGCACTCC ATGAGCTAAA ACTGGAAGAG TGGAAAGGCA GACTACAAGT TACTGAGCAC 720 

30 CTCCCTGAGA AAATTGAAAG TAGTTTACAG GAAGATGAAC CTGAGAATGA TGCTAAGAAA 780 

ATTGAAGCAC TGCTAAACCT TCCTAGAAAC CCTTCAGTAA TAGATAAACA AGACAAGGAC 840 

TGAAAGTGCT CTGAACTTGA AACTCACTGG AGAGCTGAAG GGAGCTGCCA TGTCCGATGA 900 

35 

ATGCCAACAG ACAGGCCACT CTTTGGTCAG CCTGCTGACA AATTTAAGTG CTGGTACCTG 960 



TGGTGGCAGT GGCTTGCTCT TGTCTTTTTC TTTTCTTTTT AACTAAGAAT GGGGCTGTTG 1020 

40 TACTCrCACT TTACTTATCC TTAAATTTAA ATACATACTT ATGTTTGTAT TAATCTATCA 1080 

ATATATGCAT ACATGAATAT ATCCACCCAC CTAGATTTTA AGCAGTAAAT AAAACATTTC 1140 

GCAAAAGATT AAAGTTGAAT TTTACAGTTA AAAAAAAAAA AAAAAAAAAA AAA 1193 

45 



(2) INFORMATION FOR SEQ ID NO: 260: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1262 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
55 (D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 
GAAAAACCCA AAGATGCAGA CAATCTCTTT GAACATGAAT TGGGGGCTCT CAATATGGCT 60 
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GCATTACTAC GAAAAGAAGA AAGAGCAAGT CTTCTTAGTA ATCTTGGCCC ATGTTGTAAG 120 

GCGTTGTGCT TCAGACX3GGA TTCTGCAATT CGAAAGCAGC TTGTTAAAAA TGAGAAGGGC 180 

_ 5 ACCATAAAAC AAGCTTACAC GAGTSCTCCA ATGGTAGACA ATGAATTACT TCGATTGAGT 240 

CTTCGGTTAT TTAAGCGGAA GACTACTTGC CATGCTCCAG GACATGAAAA GACTGAAGAT 300 

AATAAACTTT CACAGTCCAG TATCCAACAG GAACTGTGTG TGTCTTAAGA CCGAAGTTCA 360 

10 

ATATGGTATT TTTGGTACTG TCTTCCTTCA GCAGTGCATA TTCTTTTGCA AAGTTCTTTG 420 

GTTTGACAAG CATTAGTGAC AAAGGCAGAA AAGATTTATC AGCCATGCTA AAAGAGTGAA 480 

15 GAATTTTGAT CTTTAGAGAC ACTAGTTTTC GCCAACTTAA GATTTTACGT TAATTTTTAC 540 

ATAGTATTTG ACACTCATGC AAAATAATGT GAAAACATCT AGATTTAGTA GTTTATTCTG 600 

CGCCTTTTGT TAAAACTGAA GATITTGGAA AATGGTTGTC ACTGCTCTTC CAGCCTATGA 660 

20 

ATATTTTTGT GAAATGGAAC CATGGATTTA TGTCTGGATC ATCCATACAG AACCAACAAT 720 

TTTA'rrCAAA AACAATGTGT TCATCAAAGT AATTGCTCAC ATTGTGCAGT ACTATGTTGT 780 

25 ACAGACCACG TGAAAGGGAA TGCTGGTCTA GCTGGCGTGG TATGTTTATA GGCGAATTTC 840 

AGCAGAAGGA AGCCAAAATA GTTTTTTCCT TTTGAAAGTT TTTTAAAAAT TATTTCATGG 900 

GTCTTTTTTT TAATTAATAT GTGTGCATTG TTACAATGTA TGTTGGATGT CTTTTGACCC 960 

30 

TAAATGCTTT TTTTGTTATC AGAGATTGTG TACTATTTTT ATTTTTAATA AATGTATCTT 1020 

CCCTTTCCTT GTTTTAGATT TACTTTGCTC TTCGTTAATC TTATTCCTGA TGATCTAGAA 1080 

35 CATTAGTCAT CAACATTACA TGTTTCATGC TTCAGATATT TTACTGCTTG TGTCCTTATT 1140 



40 



GTTGGACAGC TTTAAACAGA GTTGATGGTA CTTCAAATAT AGCTCAITGA TACTTAAGGG 1200 
CANCTTCCTT GGGATGTQGG CTTTTTGGAA GGAAAAAAAT TNCCXTCAAAG GCAAATCCCA 1260 
GT 1262 



45 

(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1179 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

55 

GGCAAACTTT CCCCCAANGC TTCGAAACTT GCAAGCCGAA ACCTTGAATC GTTAAAAGTT 60 
GGGTTGCGNC GGCGCCCTGG CCCGAAGAAG CGCAATTGGC GTTCCGCGAA CGTTGGCCCT 120 
60 CAACGGCTCG GCAGCCAGCC ATGTCCTGCA CCCAGGACAG CGGCCCTGQG CTACAAGGAC 180 
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CTGGACCTCA TCTTCCTGCG CCGACCTGCG CGGGGAAGGG GAGrTTTCAGA CTGTGAAGGA 
CGTCGTGCTG GACTGCCTGT TGGACTTCTT ACCCX3AGGGG GTGAACAAAG AGAAGATCAC 
ACCACTCACG CTCAAGGAAG CrTATGTGCA GAAAATGGTT AAAGTGTCCA ATGACTCTGA 
CCGATGGAGT CTTATATCCC TGTCAAACAA CAGTGGCAAA AATGTGGAAC TGAAATTTGT 
GGATTCCXrrC CGGAGGCAGT TTGAATTCAG TGTAGATTCT TTTCAAATCA AATTAGACTC 
TCTTCTGCTC TTTTATGAAT GTTCAGAGAA CCCAATGACT GAGACATTTC ACCCCACAAT 
AATCGGGGAG AGCGTCTATG GCGATTTCCA GGAAGCCTTT GATCACXTTT GTAACAAGAT 
CATTGCCACC AGGAACCCAG AGGAAATCCG AGGGGGAGGC CTGCTTAAGT ACTGCAACCT 
CTTGGTGAGG GGCTTTAGGC CCGCCTCTGA TGAAATCAAG ACCCTTCAAA GGTATATGTG 
TTCCAGGTTT TTCATCGACT TCTCAGACAT TGGAGAGCAG CAGAGAAAAC TGGAGTCCTA 
TTTGCAGAAC CACTTTGrGG GATTGGAAGA CCGCAAGTAT GAGTATCTCA TGACCCTTCA 
TGGAGTGGTA AATGAGAGCA CAGTGTGCCT GATGGGACAT GAAAGAAGAC AGACTTTAAA 
CCTTATCACC ATGCTGGCTA TCCGGGTGTT AGCTGACCAA AATGTCATTC CTAATGTGGC 
TAATGTCACT TGCTATTACC AGCCAGCCCC CTATGTAGCA GATGCCAACT 1TAGCAATTA 
CTACATTGCA CAGGTTCAGC CAGTATTCAC GTGCCAGCAA CAGACXTTACT CCACTTGGCT 
ACCCTGCAAT TAAGAATCAT TTAAAAATGT CCTGTGGGGA AGCCATTTCA GACAAGACAG 
GAGAGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAGAGC 
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(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

<A) I.ENGTH: 1162 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 
GGCAAACTTT CCCCCAANGC TTCGAAACTT GCAAGCCGAA ACCTTGAATC GTTAAAAGTT 
GGGTTGCGNC GGCGCCCTGG CCCGAAGAAG CGCAATTGGC GTTOCGCGAA CGTTGGCCCT 
CAACGGCTCG GCAGCCAGCC ATGTCCTGCA CCCAGGACAG CGGCCCTGGG CTACAAGGAC 
CTGGACCTCA TCTTCCTGCG CCGACCTGCG CGGGGAAGGG GAGTTTCAGA CTGTGAAGGA 
CGTCGTGCTG GACTGCCTGT TGGACTTCTT ACCCGAGGGG GTGAACAAAG AGAAGATCAC 
ACCACTCACG CTCAAGGAAG CTTATGTGCA GAAAATGGTT AAAGTGTGCA ATGACTCTGA 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
108O 
1140 
1179 



60 
120 
180 
240 
300 
360 



ISOOCID: <WO_9839448Aa_IA> 



wo 98/39448 



PCT/US98/04493 



490 



10 



15 



20 



25 



CCGATGGAGT CTTATATCCC TGTCAAACAA 
GGATTCCCTC CGGAGGCAGT TTGAATTCAG 
TCTTCTGCTC TTTTATGAAT GTTCAGAGAA 
AATGGGGGAG AGCGTCTATG GCX3ATTTCCA 
CATTGCCACC AGGAACCCAG AGGAAATCCG 
CTTGGTGAGG GGCTTTAGGC CCX3CCTCTGA 
TTCCAGGTTT TTCATCGACT TCTCAGACAT 
TTTGCAGAAC CACTTTGTGG GATTGGAAGA 
TGGAGTGGTA AATGAGAGCA CAGTGTGCCT 
CCTTATCACC ATGCTGGCTA TCCGGGTGTT 
TAATGTCACT TGCTATTACC AGCCAGCCCC 
CTACATTGCA CAGGTTCAGC CAGTATTCAC 
ACCCTGCAAT TAAGAATCAT TTAAAAATGT 
GAGAGAAAAA NAAMGAAAAG AG 



CAGTGGCAAA AATGTGGAAC TGAAATTTGT 420 

TGTAGATTCT TTTCAAATCA AATTAGACTC 480 

CCCAATGACT GAGACATTTC T^^CCCACAAT 540 

GGAAGCCTTT GATCACCTTT GTAACAAGAT 600 

AGGGGGAGGC CTGCTTAAGT ACTGCAACCT 660 

TGAAATCAAG ACCCTTCAAA GGTATATGTG 720 

TGGAGAGCAG CAGAGAAAAC TGGAGTCCTA 780 

CCGCAAGTAT GAGTATCTCA TGACCmCA 840 

GATGGGACAT GAAAGAAGAC AGACTTTAAA 900 

AGCTGACCAA AATGTCATTC CTAATGTGGC 960 

CTATGTAGCA GATGCCAACT TTAGCAATTA 1020 

GTGCCAGCAA CAGACCTACT CCACTTGGCr 1080 

CCTGTGGGGA AGCCATTTCA GACAAGACAG 1140 

1162 
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(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 735 base pairs 
(B) TYPE: nucleic acid 

tC)— STRANDEDNESSr-doume ~ 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTi™: SEQ ID NO: 263: 

CGGGCTGGGT ATTTGCCTCG CACCATGGCG CCCAAGGGCA AAGTGGGCAC GAGAGGGAAG 60 

AAGCAGATAT TTGAAGAGAA CAGAGAGACT CTGAAGTTCT ACCTGCGGAT CATACTGGGG 120 

GCCAATGCCA TTTACTGCCT TGTGACGTTG GTCTTCTTTT ACTCATCTGC CTCATTTTGG 180 

GCCTGGTTGG CCTTGGGCTT TAGTCTGGCA GTGTATGGGG CCAGCTACCA CTCTATGAGC 240 

TCGATGGCAC GAGCAGCGTT CTTCTGAGGA TGGGGCCCTG ATGGATGGTG GCACGAGCTC 300 

AACATGGAGC AGGGCATGGC AGAGCACCTT AAGGATGTGA TCCTACTGAC AGCCATCGTG 360 

CAGGTGCTCA GCTGCTTCTC TCTCTATGTC TGGTCCTTCT GGCTTCTGGC TCCAGGCCGG 420 

GCCCTTTACC TCCTGTGGGT GAATGTGCTG GGCCCCTGGT TCACTGCAGA CAGTGGCACC 480 

CCAGCACCAG AGCACAATGA GAAACGGCAG CGCCGACAGG AGCGGCGGCA GATGAAGCGG 540 

TTATAGCCAT TGACATTGTG GCCACAGGCC ACTGGCCCTG GGTGGCTCTG TCAGGGTGCA 600 
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CAGCCCCTCA TGCCTGGAGC AATGAGGGTC TAGTCCAGGG GCCAAAAGCA GTCTGAGGTA 660 

TTGGGTATAC TTATACTCTA TAGGGTCGTT GAATAAATGG CTTAGAATGT GAAAAAAAAA 720 

AAAAAAAAAA ATTTT 735 

(2) INFORMATICS FOR SEQ ID NO: 264: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 base pairs 
15 (B> TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

AAGTGCATGA GCTGCCGATG TGGTGCTTAG TGATTGCGGT TTCGGTCGCT CTCCCGTGTT 60 

TCCCGGGCTG GGTATTTGCC TCGCACCATG GCGCCCAAGG GCAAAGTGGG CACGAGAGGG 120 

25 AAGAAGCAGA TATTTGAAGA GAACAGAGAG ACTCTGAAGT TCTACCTGCG GATCATACTG 180 

GGGGCCAATG CCATTTACTG CCTTGTGACG TTGGTCTTCT TTTACTCATC TGCCTCATTT 240 

TGGGCCTGGT TGGCCTGGGC TTTAGTCTGG CAGTGTATGG GGCCAGCTAC CACTCTATGA 300 

30 

GCTCGATGGC ACGAGCAGCG TTCTCTGAGG ATGGGGCCCT GATGGATGGT QGCATGGACC 360 
TCAACATGGA GCAGGGCATG GCAGAGTGAG TGTCCCCCAC CGCCAGCCCA GGCACCTTAA 420 
35 GGATGTGATC CTACTGACAG CCATCGTGCA GGTGCTCAGC TGCTTCTCTC TCTATGTCTG 480 
GTCCOnxnxXi-CTTCTGGCTC-XAGGCCGGGC-CC^^ GTGTGGGTGA^-ATGTGCrGGG —54^0- ^ 



CCCCTGGTTC ACTGCAGACA GTGGCACCCC AGCACCAGAG CACAATGAGA AACQGCAGCG 600 

CCGACAGGAG CGGCGGCAGA TGAAGCGGTT ATAGCCATTG ACGATTTKGC SACNRGCCAC 660 

TGGCCCTGGG TGGCTCTGTC AGGGTGCACA GCCCCTCATG CCTQGAGCAA TGAGGGTCTA 720 

45 GTCCAGQGGC CAAAAGCAGT CTGAGGTATT GGGTATACTT ATACTCTATA GGGTCGTTGA 780 

ATA 783 



(2) INFORMATION FOR SEQ ID NO: 265: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 
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GGCACGAGGC GGCGGCAGCG GTGGCGGCXX3 CGCCCCCOGG CX3GGAGCCGT NCCCTTTCCC 60 

GTCGGGGAGC GCGGGGYCGG GGYCXIAGGGG ANCCCGGGMC ACGGAGAGCG GGAAGAGGAT 120 

. 5 

GGATTGCCCG GCCCTCCCCC CCGGATGGAA GAAGGAGGAA GTGATCCGAA AATCTGGGCT 180 

AAGTGCTGGC AAGAGCGATG TCTACTACTT CAGTCCAAGT GGTAAGAAGT TCAGAAGCAA 240 

10 GCCTCAGTTG GCAAGGTACC TGGGAAATAC TGTTGATCTC AGCAGTTTTG ACTTCAGAAC 300 

TGGAAAGATG ATGCCTAGTA AATTACAGAA GAACAAACAG AGACTGCGAA ACGATCCTCT 360 

CAATCAAAAT AAGGGTAAAC CAGACTTGAA TACAACATTG CCAATTAGAC AAACAGCATC 420 

15 

AATTTTCAAA CAACCGGTAA CCAAAGTCAC AAATCATCCT AGTAATAAAG TGAAATCAGA 480 

CCCACAACGA ATGAATGAAC AGCCACGTCA GCTTTTCTGG GAGAAGAGGC TACAAGGACT 540 

20 TAGTGCATCA GATGTAACAG AACAAATTAT AAAAACCATG GAACTACCCA AAGGTCTTCA 600 

AGGAGTTGGT CCAGGTAGCA ATGATGAGAC CCTTTTATCT GCTGTTGCCA GTGCTTTGCA 660 

CACAAGCTCT GCGCCAATCA CAGGGCAAGT CTCCGCTGCT GTGGAAAAGA ACCCTGCTGT 720 

25 

TTGGCTTAAC ACATCTCAAC CCXTTCTGCAA AGCTTTTATT GTCACAGATG AAGACATCAG 780 

GAAACAGGAA GAGCGAGTAC AGCAAGTACG CAAGAAATTG GAAGAAGCAC TGATGGCAGA 840 

30 CATCTTGTCG CGAGCTGCTG ATACAGAAGA GATGGATATT GAAATGGACA GTGGAGATGA 900 

AGCCTAAGAA TATGATCAGG TAACTTTCGA CCGACTTTCC CCAAGAGAAA ATTCCTAGAA 960 

ATTGAACAAA AATGTTTCCA CrGGCTTTTG CCTGTAAGAA AAAAAATGTA CCCGAGCACA 1020 

35 

TAGAGCTTTT TAATAGCACT AACCAATGCC TTTTTAGATG TATTTTTGAT GTATATATCT 1080 

ATTATTCAAA AAATCATGTT TATTTTGAGT CCTAGGACTT AAAATTAGTC TTTTGTAATA 1140 

40 TCAAGCAGGA CCCTAAGATG AAGCTGAGCT TTTGATGCCA GGTGCAATCT ACTGGAAATG 1200 

TAGCACTTAC GTAAAACATT TGTTTCXX:CC ACAGTTTTAA TAAGAACAGA TCAGGAATTC 1260 

TAAATAAATT TCCCAGTTAA AGATTATTGT GACTTCACTG TATATAAACA TATTTTTATA 1320 

45 

CTTTATTGAA AGGGGACACC TGTACATTCT TCCATCRTCA CTGTAAAGAC AAATAAATGA 1380 

TTATATTCAC AGACTGATTG GAATTCTTTC TGTTGAAAAG CACACACAAT AAAGAACCCC 1440 

50 TCGTTAGCCT TCCTCTGATT TACATTCAAC TCTGATCCCG GGGCCTT/^ TTTGACATGG 1500 

GAGGTGGGAG GAAGATAGCG CATATATTTG CAGTATGAAC TATTGCCTCT GGGACGTTGT 1560 

GAGGAATTGT GCTTTCACCA GAATTTCTAA GGATTTCTGG CTTAAATATC ACCTAGCXTTG 1620 

55 

TGGTAATTTT TTTTCCXTT 1638 

60 
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(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1455 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEESNESS : doubl e 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
CGTGCGTACT GCCATGCAGG TACCQGGTCC GGAATTCCCA GGGTCGACCC ACGCGTCCGC 60 
TCAGTTGGCA AGGTACCTGG GAAATACTGT TGATCTCAGC AGTTTTGACT TCAGAACTGG 120 

AAAGATGATG CCTAGTAAAT TACAGAAGAA CAAACAGAGA CTGCGAAACG ATCCTCTCAA 180 

TCAAAATAAG GGTAAACCAG ACTTGAATAC AACATTGCCA ATTAGACAAA CAGCATCAAT 240 
TTTCAAACAA CCGGTAACCA AAGTCACAAA TCATCCTAGT AATAAAGTGA AATCAGACCC 300 
ACAACGAATG AATGAACAGC CACGTCAGCT TTTCTGGGAG AAGAGGCTAC AAGGACTTAG 360 

TGCATCAGAT GTAACAGAAC AAATTATAAA AACCATGGAA CTACCCAAAG GTCTTCAAGG 420 
AGTTGGTCCA GGTAGCAATG ATGAGACCCT TTTATCTGCT GTTGCCAGTG CTTTGCACAC 480 
AAGCTCTGCG CCAATCACAG GGCAAGTCTC CGCTGCTGTG GAAAAGAACC ' CTGCTGTTTG 540 
GCTTAACACA TCTCAACCCC TCTGCAAAGC TTTTATTGTC ACAGATGAAG ACATCAGGAA 600 
ACAGGAAGAG CGAGTACAGC AAGTACGCAA GAAATTGGAA GAAGCACTGA TGGCAGACAT 660 
CTTGTCGCGA GCTGCTGATA CAGAAGAGAT GGATATTGAA ATGGACAGTG GAGATGAAGC 720 
CTAAGAATAT GATCAGGTAA CTTTCGACCG ACTTTCCCCA AGAGAAAATT CCTAGAAATT 780 

-GftACAAAftAT-GTTTeeAeTG-GeTTTTOeeT-GTAAfiAftAAA-AAATGTACee^ GAOeACATAG- : 84^ — 

AGCTTTTTAA TAGCACTAAC CAATGCCTTT TTAGATGTAT TTTTGATGTA TATATCTATT 900 
ATTCAAAAAA TCATGTTTAT TTTGAGTCCT AGGACTTAAA ATTAGTCTTT TGTAATATCA 960 

AGCAGGACCC TAAGATGAAG CTGAGCTTTT GATGCCAGGT GCAATCTACT GGAAATGTAG 1020 

CACTTACGTA AAACATTTGT TTCCCCCACA GTTTTAATAA GAACAGATCA GGAATTCTAA 1080 

ATAAATTTCC CAGTTAAAGA TTATTGTGAC TTCACTGTAT ATAAACATAT TTTTATACTT 1140 

TATTGAAAGG GGACACCTGT ACATTCTTCC ATCRTCACTG TAAAGACAAA TAAATGATTA 1200 

TATTCACAGA CTGATTGGAA TTCTTTCTGT TGAAAAGCAC ACACAATAAA GAACCCCTCG 1260 

TTAGCCTTCC TCTGATTTAC ATTCAACTCT GATCCCGGGG CCTTAGGTTT GACATGGGAG 1320 

GTGGGAGGAA GATAGCGCAT ATATTTGCAG TATGAACTAT TGCCTCTGQG ACGTTGTGAG 1380 

GAATTGTGCT TTCACCAGAA TTTCTAAGGA TTTCTGGCTT AAATATCACC TAGCCTGTGG 1440 

TAATTTTTTT TCCCT 1455 
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(2) INFORMATION FOR SEQ ID NO: 267: 

- 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 1086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

CGCCTGCAGT ACCGGTCCGG AATTCCCGGG TCGACCCACG CGTCGCTGAC CCAGGAGAAG 60 

15 

CTGCCTGTCT ACATCAGCCT GGGCTGCAGC GCGCTGCCGC CGCGGGGCCG GCAGCTGAAC 120 

TATGTGCTCT TCAQGGCGGG CACCGTGTTG CATTCATCTT TGTACCCCCA GCATCTAGCA 180 

20 GTGTTGGCAT GTAGTAGGCA CTCAAGAAAT GTGTC?rTGAA TC3AACGATGC CTGTGACAAG 240 

CAAGCGGACT TTATTCTTTC CTGACCCTTG CTCCTATGAC ACACCTCCTC CTGACTGCCA 300 

CTGTCACTCC TTCAGAGCAG AACTCCTCTA GGGAACCTGG ATGGGAAACA GCCATGGCCA 360 

25 

AGGACATCCT GGGTGAAGCA GGGCTACACT TTGATGAACT GAACAAGCTG AGGGTGTTGG 420 

ACCCAGAGGT TACCCAGCAG ACCATAGAGC TGAAGGAAGA GTGCAAAGAC TTTGTGGACA 480 

30 AAATTGGCCA GTTTCAGAAA ATAGTTGGTG GTTTAATrGA GCTTGTTGAT CAACTTGCAA 540 

AAGAAGCAGA AAATGAAAAG ATGAAGGCCA TCGGTGCTCG GAACTTGCTC AAATCTATAG 600 

CAAAGCAGAG AGAAGCTCAA CAGCAGCAAC TTCAAGCCCT AATAGCAGAA AAGAAAATGC 660 

AGCTAGAAAG GTATCGGGTT GAATATGAAG CTTTGTGTAA AGTAGAAGCA GAACAAAATG 720 



35 



AATTTATTGA CCAATTTATT TTTCAGAAAT GAACTGAAAA TTTCGCTTTT ATAGTAGGAA 780 

40 GGCAAAACAA AAAAAAGCCT CTCAAAACCA AAAAAACCTC TGTAGCATTC CAGCGGCTTG 840 

ACCAATGACC TATGTCACAA GAGGTGGCGT GTAAGGAATG CAGCCCCCTG AAGACAGCAC 900 

TACAAGTCTG GGGGAGCCAG TTTTAACATC AGTGCACAGC TGCTGCTGGT GGCCCTGCAG 960 

45 

TGTACGTTCT CACCTCTTAT GCTTAGTTGG AACTAAGCAG TTTGTAAACT TTCATCCTTT 1020 

TmTGTAAA TTCACAAAGC TTTGGAAGGA GARGCAATAA AnTTTGKTT TCNAAATGGC 1080 

50 TTGATG 1086 



55 (2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 
60 (C) STEIANDEDNESS: double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

- 5 GGCACGGGAG CAGCCGGGCT GGTCCTGCTG CGAGCCGGCG GCCCGGAGTG GGGCGGCGGA 60 

GCPAACATGA ACGTTGGAGT TGCCCACAGT GAAGTGAATC CAAATACCCG TGTCATGAAC 120 

AGCCGGGGTA TGTGGCTGAC ATATGCATTG GGAGTTGGCT TGCTTCATAT TGTCTTACTC 180 

10 

AGCATTCCCT TCTTCAGTGT TCCTGTTGCT TGGACTTTAA CAAATATTAT ACATAATCTG 240 

GGGATGTACG TATTTTTGCA TGCAGTGAAA GGAACACCTT TCGAAACTCC TGACCAGGGT 300 

15 AAAAGCAAGG CTCCTAACTC ATTGGGAACA ACTGGACTAT GGAGTACAGT TTACATCTTC 360 

ACGGAAGTTT TTCACAATTT CTCCAATAAT TCTATATTTT CTGGCAAGTT TCTATACGAA 420 

GTATGATCCA ACTCACTTCA TCCTAAACAC AGCTTCTCTC CTGAGTGTAC TAATTCCCAA 480 

20 

AATGCCACAA CTACATGGTG TTCGGATCTT TGGAATTAAT AAGTATTGAA ATGTTTTGAA 540 

ACTGAAAAAA AATTTTACAG CTACTGAATT TCTTATAAGG AAGGAGTGGT TAGTAAACTG 600 

25 CACTGTTTCT CTGATAATGT GAAATGAGAA GTATTTACAT TGGAGGGCCA ATGGCTGGTC 660 

CTTCAAGTGC TGTTTTGAAG TGCAGATTTC CATTAAATGA TGCCTCTGTT TAATACACCT 720 

GGTACATTTC TGAAGAGGGG CTTTATAAGC AGGCTGGGCA GQCCCAGCTT ATAAGTTAAA 780 

30 

GGGCATCACA GTGAGGGTGT AGTAGATAAA TTCAAGGAAA TAAGAGATTT GTAAGAAACT 840 
AGGACCAGCT TAACTTATAA TGAATGGGCA 1TGTGTTAAG AAAAGAACAT TTCCAGTCAT 900 
35 TCAGCTGTGG TTATTTAAAG CAGACTTACA TGTAAACCGG AATCCTCTCT ATACAAGTTT 960 
ATTAAAGATT-^TTTTTATTA -eeGTAAAAAAr-AAAAAAAAAAr^ftAA 1:063 



40 



(2) INFORMATION FOR SEQ ID NO: 269: 



(i) SEQUENCE CHARACTERISTICS: 
45 <A) LENGTH: 1234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDKHNESS : double 
<D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

ATCAGCATCT ACAAGTAGCA TATTTTGGAT GGTGTTTGTG TGCTACTTCA AAGTAACTAG 60 

GAAAAAATAA TCCTCGCAAC ACAGGTACCT TGTCATGTCA GAATTGGGGG TGTTAGGTTG 120 

55 

CCAGTTGTAT CAGTGTTGAT TCATTTCATT ACTTCCTACA GAGCAAACAT GAACGTTGGA 180 

GTTGCCCACA GTGAAGTGAA TCCAAATACC CGTGTCATGA ACAGCCGGGG TATGTGGCTG 240 

60 ACATATGCAT TGGGAGTTGG CTTGCTTCAT ATTGTCTTAC TCAGCATTCC CTTCTTCAGT 300 
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GTTCCTGTTG CTTGGACTTT AACAAATATT ATACATAATC TCSGGGATGTA CGTATTTTTG 
CATGCAGTGA AAGGAACACC TTTCGAAACT CCTGACCAGG GTAAAGCAAG GCTCCTAACT 
CATTGGGAAC AACTGGACTA TGGAGTACAG TTTACATCTT CACGGAAGTT TTTCACAATT 
TCTCCAATAA TTCTATATTT TCTGGCAAGT TTCTATACGA AGTATGATCC AACTCACTTC 
ATCCTAAACA CAGCTTCTCT CCTGAGTGTA CTAATTCCCA AAATGCCACA ACTACATGGT 
GTTCGGATCT TTGGAATTAA TAAGTATTGA AATGTTTTGA AACTGAAAAA AAATTTTACA 
GCTACTGAAT TTCTTATAAG GAAGGAGTGG TTAGTAAACT GCACTGTTTC TSTGATAATG 
TGAAATGAGA AGTATTTACA TTGGAGGGCC AATGGCTGGT CCTTCAAGTG CTGTTTTGAA 
GTGCAGATTT CCATTAAATG ATGCCTCTGT TTAATACACC TGGTACATTT CTGAAGAGGG 
GCTTTATAAG CARGCTGGGC AQGCCCAGCT TATAAGTTAA AGGGCATCAC AGTGAGGGTG 
TAGTAGATAA ATTCAAGGAA ATAAGAGATT TGTAAGAAAC TAGGACCAGC TTAACTTATA 
ATGAATGGGC ATTGTGTTAA GAAAAGAACA TTTCCAGTCA TTCAGCTGTG GTTATTTAAA 
GCAGACTTAC ATGTAAACCG GAATCCTCTC TATACAAGTT TATTAAAGAT TATTTTTATT 
ACCRTACATA TTTCKCTTGT TTTATGTAAG YGGATGTATA TCCTCTTGTT TTATACAAGC 
CAGTHXCAC TTATGAGGGT ACTTTTTTGG TTTTGCTGGG CTTAATATTG TGTATTGGTC 
AATGAGGCCA TTTTTACANT TATTAACGTT ACAG 



(2) INFORMATION FOR SEQ ID NO:"~!2:"7D: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 
NGAGGTGCGT TCTGAGCCGT CTGTCCTGCG CCAAGATGCT TCAAAGTATT ATTAAAAACA 
TATGGATCCC CATGAAGCCC TACTACACCA AAGTTTACCA QGAGATTTGG ATAGGAATGG 
GGCTGATGGG CTTCATCGTT TATAAAATCC GGGCTGCTGA TAAAAGAAGT AAGGCTTTGA 
AAGCTTCAGC GCCTGCTCCT GGTCATCACT AACCAGATTT ACTTGGAGTA CATGTGAAAG 
AAAACGTCAG TCTGCCTGTA AATTTCAGCA AGCCGTGTTA GATGGGGAGC GTGGAACGTC 
ACTGTACACT TGTATAAGTA CCGTTTACTT CATGGCATGA ATAAATGGAT CTGTGAGATG 
CACTGCTACC TGGTACTGCT TTCAGTGTGT TCCCCCTCAG CCCTCCGGCG TGTCAGGCAT 



360 
420 
480 
540 
600 
660 
720 
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840 
900 
960 
1020 
1080 
1140 
1200 
1234 
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300 
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420 
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ACTCTGAGTA GATAATTTGT CATGCAGCGC ATGCAATCAG AATCTCACTG AGCCACCCAT 480 
CATTGTGAAA TAATTACCTC AGTTGTACAG GACTTGGTGA TCAGGATCCA GGCACTCACT 540 
- 5 TCTATTCTAC TGCTCAATAA ACGTTTATTA AACT 574 



10 (2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1731 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 

20 GCTGCAAGGT GCGCCTCGTG CCGCTGCAGA TCCAGCTCAC TACCCTGGGA AATCTTACAC 60 

CTTCAAGCAC TGTGTTTTTC TGCTGTGATA TGCAGGAAAG GTTCAGACCA GCCATCAAGT 120 

ATTTTGGGGA TATTATTAGC GTGGGACAGA GATTGTTGCA AGGGGCCCGG ATTTTAGGAA 180 

25 

TTCCTGTTAT TGTAACAGAA CAATACCCTA AAGGTCTTGG GAGCACGGTT CAAGAAATTG 240 

ATTTAACAGG TGTAAAACTG GTACTTCCAA AGACCAAGTT TTCAATGGTA TTACCAGAAG 300 

30 TAGAAGCGGC ATTAGCAGAG ATTCCCGGAG TCAGGAGTGT TGTATTATTT GGAGTAGAAA 360 

CTCATGTGTG CATCCAACAA ACTGCCCTGG AGCTAGTTGG CCGAGGAGTC GAGGTTCACA 420 

TTGTTGCTGA TGCCACCTCA TCAAGAAGCA TGATGGACAG GATGTTTGCC CTCGAGCGTC 480 

TCGCTCRARC CNGGGATCAT AGTGACCACG AGTGNAGGCT GTTCTGCTTC AGCTGGTAGC 540 
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TGATAAGGAC CATCCT^AAAT TCAAGGAAAT TCAGAATCTA ATTAAGGCGA GTGCTCCAGA 600 

40 GTCGGGTCTG CTTTCCAAAG TATAGGACAT 1TGAAGAACT GGTATGCTAC TCACTGGTGA 660 

AGGACAGTCA GGTGAAGGAC TGTAAGCCCA CACAAGCTCT TCTTATCTCT ACTAGAATTA 720 

AAATGTTAAG TCAAAAACGG CTCCTTTTTT GCGCCTCCTA GTGAACTTAA CCAGCTAGAC 780 

45 

CATTTGAGTA CCAGCATTTA GTTACAAACG TCAAAGGCTT CCGQTGCTGC TTACCTTCCT 840 

TTTTTGTTAA TGTGCTTTTA TTTATTAAAA AAAATTACAA TGAAGATGCC TGTrTTGTCT 900 

50 CTACTGTGTA CTCTGATCGT ATCTTTCCAA AGTGCAGACT CTTGTGAAGT TTTCTTAAAT 960 

TGTTCACTTT AAAGAAAATG ACGTACCAAC AATGATTTGG CmTATATT ACTGTAAGAT 1020 

GTTATAATGT TAATGTGGAT GTAGTGCTTT TACTTTACAG ATTGATTGGA ATAAGATTAT 1080 

55 

TGCATATGAA TTTACCCACA GGACTCTGAA TCATGTTACC CACTCCCCTC ACAATGTTGT 1140 

CCACTTAGTG AGTTGCATTG ATCTATCCGT ACCAAATGAT GTTGAATAAT TACATATCTT 1200 

60 TCTKGACTAT ACTGATTTCT TATTTTGGTC ACTATTACTA AATCTCTGTT AATTOTCTCT 1260 
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CTTTTAACTG 
TGTTTTAACA 
TTCTAGTGTC 
GTTCATGTTC 
TCAGTAATGA 
TCATTAAATA 
CTGTCTTTTA 
CACAAGATGG 



AAAAGGGATG 
TCTTTGAAGT 
CACAAGATTT 
TGTTCTATAT 
TTTGGTAGTA 
CTTRTTAGAG 
TATTAAAGTA 
CTCTATGAGT 



GGATAGAAGG 
ATGGCTTGCT 
AGCAAAAAGA 
TTTCTTCACC 
TTAATTTTGT 
GGTTTTGAAA 
ATTAAAGAAA 
ATTCTTCCAG 



GTTTGCAATG 
GAATATCTTT 
TAAAGCTTGG 
TACTCTCCAA 
GGTCATTGTT 
TGTTTTTCAA 
ATGTATTGTG 
GGATTCTAAT 



CCATATTATT 
ACCAACATCT 
GTGGAATATC 
ATATTGTAAT 

ATATCTGAAA 
ATTGAAATTA 
ATTTATTTAA 



GGTGGAGGGC 
TGAATATATA 
ATTTTAAAAT 
GCAAAAAGTC 
AAATTTATIT 
TGTGAAACTG 
TTTTGNCCTC 
G 
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(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1320 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIiOGV: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

CTGCTTAGGA AGAGAAGGTC AGAGTTCGCG GGGGCAGAGG CATTCTTGCC GCTGGCCCAG 60 

TCACTATGTA GTGGAGGGGC AGACACCCTC CCGCAAATTC TGGAAGGTTC TTAGTCTCGA 120 

CTAGGGCAGT AGCCCAGGAC TCCTAGTCGC CGGCTTCAGG TCACTGCCGG CTGAACGGAG 180 

cSrcCCGTCGC CATGTTTGGC TGCTTGGTGG CGGGGAGGCT ~GGTGCAAACA "GCTCCACM 240~ 

AAGTGGCAGA GGATAAATTT GTTTTTGACT TACCTGATTA TGAAAGTATC AACCATGTTG 300 

TGGTTTTTAT GCTGGGAACA ATCCCATTTC CTGAGGGAAT GQGAGGATCT GTCTACTTTT 360 

CTTATCCTGA TTCAAATGGA ATGCCAGTAT GGMAACTCCT AGGATTTGTC ACGAATGQGA 420 

AGCCAAGTGC CATCTTCAAA ATTTCAGGTC TTAAATCTGG AGAAGGAAGC CAACATCCTT 480 

TTGGAGCCAT GAATATTGTC CGAACTCCAT CTGTTGCTCA GATTGGAATT TCAGTGGAAT 540 

TATTAGACAG TATGGCTCAG CAGACTCCTG TAGGTAATGC TGCTGTATCC TCAGTTGACT 600 

CATTCACTCA GTTCACACAA AAGATGTTGG ACAATTTCTA CAATTTTGCT TCATCATTTG 660 

CTGTCTCTCA GGCCCAGATG ACACCAAGCC CATCTGAAAT GTTCATTCCG GCAAATGTGG 720 

TTCTGCAAAT GGTATGAGGC ATNTTCTGTC TCCAATATTA AGGCTTTTTA TAACTGAATA 780 

TCTATTTTGT CTATGAATAT ATTCCTTTTT TGACATTTAA ACATATTCTT TTATTGTGAA 840 

CATCAGCACT GCATGCCATT AAAGTATGTA CTATAGAGAT CTGATGAGAA acagttctta- 900 
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CCCTAAATAT TTTGTTATAT TGTCGCCATT ATGAATTTAT AAAGACAGGA AAATATAGTT 960 

GCCTATGTTT TAGGGACCAC TATTAAAGCT TATAAATATT TGTGTATTTT CATTTAGAAG 1020 

TACCATCTAT GAGAGtAGTT TATACTGCAC TGTGTACATG AATGGCTAAT GAATCTATTT 1080 

TCCAACTTTC CCX3TGTTTTA TAGATATTTC TTTTGACTTT GAGTATCCTA GAGATGGGAG 1140 

GATGCCTAGG AAGAGTTTGT TGAGAAGTGG TACCATGGTO TAGCATGGGA GAGCATTGGG 1200 

AATGCACTAG GTTTGAATTT GGCATAATGG TAGCTATGTG ACCCTGAGCA AATTTCTCTC 1260 

ATCTGCTCAT CTGANGAATG AQGAAATAGG AGTGAATTTG ATOTTTCCTA GGTCCNTCTA 1320 

(2) INFORMATION FOR SEQ ID NO: 273: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
CCCTGGAGAG GGGCTGCTGT GCCAGCTTGG GGAGGGTCTG GGATGGGGCT GCCCCTGATG 60 
30 GCCCTGATGT GGAGTACCTT GCCAGCATCT GCTQGGGTGA ACTTTATTTT AGCCCTTCCC 120 
TTGTTGYTCT TATGAAGAAC AGAGGAQGGG TGGGCAGGTC AGTGATGTCA GCAGTGAGTA 180 
TTCCCAGCAC AGCGGCTCTG GAAGAGGCAT GAGGCATTTC TTTCAGGAAA TGRTCATTAT 240 
TCAGCCAGAA GGCATTCATT AAGTAAGTCC TGACTTTGTG CCCAGCTCTG TGTTATAGGC 300 



35 



CCTTGGCGAG ACTCAGGAGG GGCARAGGAC GCTAGKTTKT AGWTAACACG GAACCTCARA 360 

40 GGOTATATGG TCCAAGAAGA CCCGGGGGCG GTGAAAACCC TGTGGACTAA TGCTCACGGG 420 

AGCCCGAGGT CACACTTTGA CTTTGCTACC ATGGGCTGTG TCTANO^CG TATATATGCT 480 

GCGTAATTAT TACAGAGGCA GTCCATGTGC ATTGT 515 

45 



(2) INFORMATION FOR SEQ ID NO: 274: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI^JESS: double 
55 (D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 
TGACACCCAT AAGGAATTCA TGAAGAAAGT AGAAGAAAAG CGAGTGGACG TTAACTCAGC 60 
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AGTAGCCATG GGAGAAGTCA TCCTGGCTGT CTGCCACCCC GATTGCATCA CAACCATCAA 
ACACTGGATC ACCATCATCC GAGCTCGCTT CX3AGGAGGTC CTGACATGGG CTAAGCAGCA 
CCAGCAGCGT CTTGAAACGG CCTTGTCAGA ACTGGTGGCT AATGCTGAGC TCCTGGAAGA 
ACTTCTGGCA TGGATCCAGT GGGCTGAGAC CACCCTCATT CAGCGGGATC AGGAGCCAAT 
CCCGCAGAAC ATTGACCGAG TTAAAGCCCT TATCGCTGAG CATCAGACAT TTATGGAGGA 
GATGACTCGC AAACAGCXTTG ACX5TGGACCG CGTCACCAP>G ACATACAAAA GGAAAAACAT 
AGAGCCTACT CACGCGOCTT TCATAGAGAA ATCCCGCAGC GGAGGCAGGA AATCCCTAAG 
TCAGCCAACC CCTCCTCCXrA TGCCAATCCT TTCACAGTCT GAAGCAAAAA ACCCACGGAT 
CAACCAGCTT TCTGCCCGCT GGCAGCAGGT GTGGCTGTTA GCACTGGAGC GGCAAAGGAA 
ACTGAATGAT GCCTTGGATC GGCTGGAGGA GTTGAAAGAA TTTGCCAACT TTGACTTTGA 
TGTCTGGAGG AAAAAGTATA TGCGTTGGAT GAATCACAAA AAGTCTCGAG TGATGGATTT 
CTTCCGGCX3C ATTGATAAGG ACCAGGATGG GAAGATAACA CX3TCAGGAGT TTATCGATGG 
CATTTTAGCA TCCAAGTTCC CCACCACCAA GTTAGAGATG A(nX5CTGTGG CTGACATITr 
CGACCGAGAT GGGGATQGTT ACATTGATTA TTATGAATTT GTGGCTGCTC TTCATCCXIAA 
CAAGGATGCG TATCGACXAA CAACCGATGC AGATAAAATC GAAGATGAGG TTACAAGACA 
AGTGGCTCAG TGCAAATGTG CAAAAAGGTT TCAGGTGGAG CAGATCGGAG AGAATAAATA 
CCGGTTCTTC CTCGGCAATC AGTTTGGGGA TTCTCAGCAG TTGCGGCTGG TCCGTATTCT 
GCGCAACCGT GATGGTTCGC GTTGGTGGAG GATGGATGGC CTTGGATGAA TTTTTAGTGA 



"AAAATGATCC~CTGCCGAQCA CGAGGTAGAA CTAACA1TOA ACTTAGAGAG AAATTCATCC 
TACCAGAGGG AGCATCXCAG GGAATGACCC CCTTCCGCTC ACGGGGTCGA AGGTCCAAAC 
CATCTTCCCX5 GGCAGCTTCC CCTACOX^GTT CXIAGCTCCAG TGCTAGTCAG AGTAACCACA 
GCTGTACATC CATGCCATCT TCTCCAGCCA CCCCAGCCAG TGGAACCAAG GTTATCCCAT 
CATCAGGTAG CAAGTTGAAA CGACX:AACAC CAACTTTTCA TTCTAGTCGG ACATCCCTTG 

ctggtgatac cagcaattag ttcttccccg gcctccacag gtgccaaaac taatcgqgca 
gaccctaaaa agtctgccag tcgccctggg agtcgggctg ggagicgagc cgggagtcga 
gccagcagcc ggcgaggaag tgacgcttct gactttgacc tcttagagac gcattgcttg 
ttccx5acact tcagaaagca gcxxttgcagg gggccaaggc aactccagga gagggctaaa 
caaacxtttcc aaaatcccaa ccatgtctaa gaagaccacc actgcctccx: ccaggactcc 
aggtcccaag cgataacact gtctaagcac ccccaagcca ctatccactt tgaatcctgc 
tccatacatt gggtgtatat ttattctgaa cgggagaagt tatattgtta aaagtgtaaa 
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AGAATAATTG TGTTATGAAG CTGCCTTATT TTTTTTCTTT TTGTAAGTTA CTATTTTCAT 1920 

(3TGAATATTT ATGTAGATAA AATTTGCCTC CTGGTAACXrC TCTAATGGAT GGGGCX!CAGA 1980 

AATGAAATAT TTGAGAAAAA CAAGTGAAAA GGTCAAGATA CAAATGTGTA TTAAAAAAAA 2040 

AAAAGCCTAT TAATAGGGTT TCTGCX3CX3GT GCAGGGTTGT AAACCTGCTT TATCmTAG 2100 

GATTATTCCT AAATGCATCT TCTTTATAAA CTTGACTTGC TATCTCAGCA AGATAAATTA 2160 

TATTAAAAAA ATAAGAATCC TGCAGTGTTT AAGGAACTCT TTTTTTGTAA ATCACGGACA 2220 

CCTCAATTAG CAAGAACTGA GGGGAGGGCT TTTTCCATTG TTTAATGTTT TGTGATTTTT 2280 

AGCTAAAGAG AGGGAACCTC ATCTAAGTAA CATTTGCACA TGGATACAGC AAAAGGAGTT 2340 

CATTGCAATA CTC5TCTTTGG ATATTGTTTC AGTACTGGGT GTTTAAAGGA CAAATAGCTG 2400 

CTAGAATTCA GGGGTAAATG TAAGTGTTCA GAAAACGTCA GAACATTTGG GGmTAAAC 2460 

TGATTTGTTG CTCCXTTATCC AGCCTAGACA CCAGTAACTC TTGTGTTCAC CAGGACCCAG 2520 

ACCCTTGGCA AGGGATAGGC TCGTTGGTGA CATTGTGAAT TTCAGATTTG TTTTATCCAC 2580 

TTTTTTTGCT ATTTATTTAA ATGGTCGATC AACTTCCCAC AAACTGAGGA ATGAATTCCA 2640 

CGAGCCOXSTT CTGAAAATGT GGACGTAAGA CAAACACX5TG CTCGTCCTTT AATGGAGTTC 2700 

ACCAGCACAC TTGTTAACCA GTCCTGTTTG CTTTCGTCTT TTTTTGTGCG TAATAAAGTC 2760 

AACTGACCAA GTGACCATGA AAAGGGGCTG TCTGGQGCTC CTGTTTTTTA GCTGCTGTTC 2820 

TTCAGCTCCX3 ACCATGOTGC TGTGTGATTA TCTCAATTGG TTTTAATTGA GGCAGAAACT 2880 

GAAGCTCTAC CAATGAACTG TTTAGAAACA AGACACACTT TTGTATTAAA ATTGCTTGCA 2940 

— GTAAGAAAAA- AAAAAAAAAA-^?tf^AAAAAAAftr AAAAAACTeG-AGGGGOeseee-^ 2^995- 
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(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 
GGGACCCGCG CGSCTCCCGG GGATGGTGAG CAAGGCGCTG CTGCNWCGTG TCTGCCGTCA 
ACCGCAGAGG ATGAAGCTGC TGCTGGGCAT CGCCTTGCTG GCCTACGTCG CCTCTGTTTG 
GGGCAACTTC GTTAATATGA GGTCTATCCA GGAAAATGGT GAACTAAAAA TTGAAAGCAA 
GATTGAAGAG ATGGTTGAAC CACTAAGAGA GAAAATCAGA GATTTAGAAA AAAGCTTTAC 
CCAGAAATAC CCACCAGTAA AGTTTTTATC AGAAAAGGAT OGGAAA7VGAA TTTTGAWTAA 
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CAGGAGGCGC AGKGTTCGTG GGCTCCCATC TKAACTGACA AACTCATGAT GGACGGCXZAC 360 

GAGGTGACCG TGGTGGACAA TTTCTTCACX3 GGCAGGAAGA GAAACGTGGA GCACTGGATC 420 

GGACATGAGA ACTTCGAGTT GATTAACCAC GAaSTGTGGG AGCCCCTCTA CATCGAGGTT 480 

GACCAGATAT ACCATCTGGC ATCTCCAGCC TCCCCTCCAA ACTACATGTA TAATCCTATC 540 

AAGACATTAA AGACCAATAC GATTGGGACA TTAAACATGT TGGGGCTGGC AAAACGAGTC 600 

GGTGCCCGTC TGCTCCTGGC CTCCACATCG GAGGTGTATG GAGATCCTGA AGTCCACCCT 660 

CAAAGTGAGG ATTACTGGGG CCACGTGAAT CXAATAGGAC CTCGGGCCTG CTACGATGAA 720 

GGCT^AACGTG TTGCAGAGAC CATGTGCTAT GCCTACATGA AGCAGGAAQG CGTGGAAGTG 780 

CGAGTGGCCA GAATCTTCAA CACCTTTGGG CCACX3CATGC ACATGAACGA TGGGCGAGTA 840 

GTCAGCAACT TCATCCTGCA GGCGCTCCAG GGGGAGCCAC TCACX3GTATA CX3GATCOQGG 900 

TCTCAGACAA GGGCGTTCCA GTACGTCAGC GATCTAGTGA ATGGCXZTCGT GGCTCTCATG 960 

AACAGCAACG TCAGCAGCCX: GGTCAACCTG GGGAACCXIAG AAGAACACAC AATCCTAGAA 1020 

TTTGCTCAGT TAATTAAAAA CCTTGTTGGT AGCGGAAGTG AAATTCAGTT TCTCTCCGAA 1080 

GCCCAGGATG ACfCCACAGAA AAGAAAACCA GACATCAAAA AAGCAAAGCT GATGCTGGGG 1140 

TGGGAGCCCG TGGTCCCGCT GGAGGAAGGT TTAAACAAAG CAATTCACTA CTTCCX5TAAA 1200 

GAACTCGAGT ACCAGGCAAA TAATCAGTAC ATCCCCAAAC CAAAGCCTGC CAGAATAAAG 1260 

AAAGGACGGA CTCGCCACAG CTGAACTCCT CACTTTTAGG ACACAAGACT ACCATTGTAC 1320 

ACTTGATGGG ATGTATTTTT GGCTTTTTTT TGTTGTCGTT TAAAGAAAGA CTTTAACAGG 1380 



TCTCATGAAG AACAAACTGG AATTTCATTC TGAAGCTTGC TTTAATGAAA TGGATGTGCC 1440 

TAAAAGCTCC CCTCAAAAAA CTGCAGATTT TGCCTTGCAC TTTTTGAATC TCTCTTTTTA 1500 

TGTAAAATAG CGTAGATGCA TCTCTGCX3TA TTTTCAAGTT TTTTTATCTT GCTGTGAGAG 1560 

CATATGTTGT GACTGTCGTT GACAGTTTTA TTTACTGGTT TCTTTGTGAA GCTGAAAAGG 1620 

AACATTAAGC QGGACAAAAA ATGCCGATTT TATTTATAAA AGTGGGTACT TAATAAATGA 1680 

GTCGTTATAC TATGCATAAA GAAAAAYCCT AGCAGTATTG TCAGGTGGTG GTGCX3CCGGC 1740 

ATTGATTTTA GGGCAGATAA AAGAATTCTG TGTGAGAGCT TTATGTTTCT CTTTTAATTC 1800 

AGAGTTTTTC CAAGGTCTAC TTTTGAGTTG CAAACTTGAC TTTGAAATAT TCCTGTTGGT 1860 

CATGATCAAG GATATTTGAA ATCACTACTG TGTTTTGCTG CGTATCTGGG GCGGGGGCAG 1920 

GTTGGGGGGC ACAAAGTTAA CATATTCTTG GTTAACCATG GTTAAATATG CTATTTTAAT 1980 

AAAATATTGA 1990 
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(2) INFORMATION FOR SEQ ID NO: 276: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

AACTTCGCTT AGCTCTCCAG GGTNAAACGG GTGAC»ICCTT AAAAACAGAA GAGAACAAGA 60 

TTTAAAGTCC GTTGCATTGA AAATAACAAA CAATATCAAT GTTTTAATCA AGGATCTCTT 120 

CCACATTCCT CCTTCTTATA AGAGCACAGT AACACTATCC TGGAAACCTG TACAAAAGGT 180 

TGAGATTGGG CAAAAGAGAG CCAGTGAAGA TACAACTTCA GGTTCACCAC CCAAGAAATC 240 

TTCAGCAGGA CCAAAAAGAG ATGCCAGGCA GATTTATAAC CCTCCCAGTG GGAAATATAG 300 

CAGCAATTTG GGCAACTTTA ATTATGAGCA GAGAGGAGCC TTCAGGGGAA GTAGAGGTGG 360 

CCGAGGTTGG GGCACACGAG GAAATCGTAG TCGGGGAAGA CTCTACTGAA TAAGACATCA 420 

GCATTCTTCA GCATTGTCAT GAGCTTAATA TACTTAAATT CTACTACTCA TTGGATTGCC 480 

GGGGATGTCC CTTTAAACAG ACTGCTGCCT TCAGCTAAAA ACTTAATGTT CTTTATACCT 540 

TTGTATGTAT GACCTACTTT TGTAACAGAC CATQGTTGTG TCCAAGGTAA AACCACAGTG 600 

ATATTTTTGG ATGCTTTGTC TGCAATCTTG ACTTGTTTTT GCAGTATCAT TATTCAGACT 660 

TCAAATTGTG AATCTTTTAA ACATCTTGAT AATTTGTTGT TGAGAGCTGT TCATTCTAAA 720 

" ATGTAATGAA^ATTCAGTCTA" GTTCTGCTGA "TAfifiGATCAT" CAGTTTTGAA--AGGTTACTGA- ~ - 7&0- - 

TTTTCCTCTT CCCTCTTAGT TTTTTACCCA ATATATGGAG AAGAGTAATG GTCAATCTTA 840 

ACATTTTGTT TTAATTGTTT AATAAAGCTG CTGGGCAGTG GTGCAGCATT CCTACCTAGT 900 

GTCATAAAAG CAAAATACTT ACATAGCTTT CTTAAAATAT AGGAATGACA TTACATTTTT 960 

AGGAGAAAGT AAGTTGCTTT GCACCGCCTA CTTAATTCTT TTCCATATAT TGTGATACAA 1020 

ACTTTTGAAT ATGGAATCTT ACTATTTGAA TAGAAATGTG TATGTATAAT ATACATACAT 1080 

ACATAAGCAT ATATGTGTGT GTGTGTGTGT ATATATATAT ATATGCATGC TGTGAAACTT 1140 

GACTACACAA CATAAATCAC TTTTTAAATT CCAGGAACGG GTAGTrCTGAC ACGGTGATTA 1200 

TCCTTTTGAG GCTGAATCCG TTATTAACTT GTTATTTAGG TTTTTACTCC CAGTAGCAAG 1260 

GGATTCTAAG TTAGTTGCAC TTACATGATT ATTGTTATTT AAAACTAAGA ATAAAGGCTG 1320 

CATTTTCAAA GATAAATTGG AATTGCTGTT GGTGAAATAA CAACCAAAAT ACTGAATCTG 1380 

ATCTACATAC AGGnTCTAC AGGAAGAGAT GGTATAATTT ACAATTTGGA GATTTAATAA 1440 
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CCAGGGCTAC CCAGAAAAAG TGACTTGATA ACATGGTACC AATAAGTAAG GGATGCTCTC 
TCGGTTTGCT TTTGCCACTT TCAAGATTTT AACTTCTCAG GTTATTAATC AAAATTATTG 
TATAAC3TTAG CCAATAGAAT TTTTAGGTTA AAACAACAGA TGGGGGGTTT GTGGAGTGTT 
TAATGTCATG GGCATnTTA GTAGCATAGA CX:CTTTGTTC TGCATTTGAA TGTTTCCTAT 
ATTTTTGTTT CACAGTTAAT CTTCCCTCCC CAAGTTTGCT ATTCAAATCA ACTGCCTGAA 
TGACATTTCT AGTAGTCTGA TGTATTTTTC TGAGGAATAG TTTGTGATTC CAATGCAGGT 
GTCTTCATTA CCATTACCTC TACACTGCAG AAGAAGCAAA ACTCCTTTAT TAGAATTACT 
GCACATGTGT ATGGGGAAAA TAGTTCTGAA AGGCTAGAAT GATACAAGTG AGCAAAAGTT 
GGTCAGCTTG GCTATGGAGT GGTQGCAATA ATCTCTAAAC ATTCCAAAAG ACCATGAGCT 
GAACCTAAAC TCCXnTGGAA TCTGAACAAA GGAATATAAA ATTGCCATTT GAAAACTC5AC 
CAGCTAATCT GGACCTCAGA GATAGATCAG CCAGTGGCCC AAAGCCATTT CAAGTACAGA 
AATTATAGAG ACTACAGCTA AATAAATTTG AACATTAAAT ATAATTTTAC CACTrTTTGT 
CTTTATAAGC ATATTTGTAA ACTCAGAACT GAGCAGAAGT GACTTTACTT TCTCAAGTTT 
GATACTGAGT TGACTGTTCC CTTATCCCTC ACCCTTCCCC TTCCCTTTCC TAAGGCAATA 
GTGCACAACT TAGGTTATTT TTGCTTCCGA ATTTGAATGA T^AAACTTAAT GCCATGGATT 
'l"i'ri"iXri'l''l'i' GCAAGACACC TGTTTATCAT CTTGTTTAAA TGTAAATGTC CCCTTATGCT 
TTTGAAATAA ATTTCCTTTT GTAATTTTAA AAAAAA 



(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 782 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

GCCACTGACT TCTCCCACCC TTCTCTCTCC CCCATAATAG TTTATTTGGT TGGTCTGGAC 

TCACTTGTGG CCTTTRATTA AATTCCTAAG GGGCCTGAAG AAGACATTTC TACTGCAGAG 

GGTTAGAGGC ACTTGAGCAA GGCCCCCACA TCCCAACTCT GGGAGTTGTG GTGGGAGGAG 

GCACTTCTGG GGGATAGGAC CAGACAAGAT AACAGGAGCT CACATGGNAA GCAGAAGCTG 

TGACAAGTTT AGTAGTCCCA AAATGGGTTA TATCCCTTCC CCCTTTACAT CAGAATCTTG 

TGAAATGGGA AAACAACAGA AGGAGGGGAT CAAAGATAGC TGATCTCACA TGCTTCCCAG 

GCAGQGCARA GGTGGGAGTC AAACCCGGGT GACAGGTGGG TGGAGAGCCC TGTTTGAGGT 
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TGTGGCTGAT CCCTCTCTGG TATTAGTTTT TCCCCTGGGA GCAGGAAGCC CTAGGAAGAG 480 

GGGACTGCAG GGTCCCCRGG GGATCTTTCC TCCCTCCCCT GCATGAGGCA GAGGCAAGCT 540 

5 

GCCTGCCAAC CCCCTCCCTC AAGGAATGGC CTTGCCCAGG AATGCCCACC ACACATACCC 600 

TCTTCTTTTT TTCTAGTCAA ACTCTTGTTT ATTCCTTGGC TTGCCTCCCT CCTTCCTCCC 660 

10 CTCTCAACCT TTACTTCTGA TTTCTATTTC ATGGAATTTG GGATTGAAGT TAAACTACAA 720 

CAGTGCCXX:C AACACCAAGT CTTGCAGGAA AAAAATACAA AGAAATTTAA CAAAAAAAAA 780 

AA 782 

15 



(2) INFORMATION FOR SEQ ID NO: 278: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 961 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOIiOGY: lineair 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

GAGTTCCGGC TGGAGACCCG TGCTCTGGGC CGGCGCCTTC ACCATGGCCT CGGCAGAGCT 60 

30 

GGACTACACC ATCGAGATCC CGGATCAGCC CTGCTGGAGC CAGAAGAACA GCCCCAGCCC 120 

AGGTCGGAAG GAGGCAGAAA CTCGGCAGCC TGTGGTGATT CTYTTGGGCT GGGGTGQCTG 180 

35 CAAGGACAAG AACCTTGCCA AGTACAGTGC CATCTACCAC AAAAGGGGCT GCATCGTAAT 240 

CCGATACACir GCCCCGTGGe-ACATGGTCrr -CTTCTeeGAG-TCAeTGOGTA^TCeCTTeAGT ^0— 

TCGTGTTTTG GCCCAGAAGC TGCTCGAGCT GCTCTTTGAT TATGAGATTG AGAAGGAGCC 360 

40 

CCTGCTCTTC CATGTCTTCA GCAACGGTGG CGTCATGCTG TACCGCTACG TGCTGGAGCT 420 

CCTGCAGACC CGTCGCTTCT GCCGCCTGCG TGTGGTGGGC ACCATCTTTG ACAGCGCTCC 480 

45 TGGTGACAGC AACCTGGTAG GGGCTCTGCG GGCCCTGGCA GCCATCCTQG AGCGCCGGGC 540 

CGCCATQCTG CGCCTGTTGC TGCTGGTGGC CTTTGCCCTG GTGGTCGTCC TGTTCCACGT 600 

CCTGCTTGCT CCCATCACAG CCCTCTTCCA CACCCACTTC TATGACAGGC TACAGQACGC 660 

50 

GGGCTCTCGC TGGCCCGAGC TCTACCTCTA YTCGAGGGCT GACGAAGTAG TCCTQGCCAG 720 

AGACATAGAA CGCATGGTGG AGGCACGCCT GGCACGCCGG GTCCTGGCGC GTTCTGTGGA 780 

55 TTTCGTGTCA TCTGCACACG TCAGCCACCT CCGTGACTAC CCTACTTACT ACACAAGCCT 840 

CTGTGTCGAC TTCATGCGCA ACTGCGTCCG CTGCTGAGGC CATTGCTCCA TCTCAMCTCT 900 

GCTCCAGAAA TAAATGCCTG ACAMCTCCCC ACAAAAAAAA AAAAAAAAAA ACTCGAGGGG 960 

60 
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(2) INFORMATION FOR SEQ ID NO: 279: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1228 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

CGCGCTTTGC AGTTCGGTCT CCTGGTGTAC GGCCAACGCC AAGTAGGGGA TTGCGTTCCC 60 

TCCAGTCGCA GCCCTATCAG ATTTGGATAT GTCCTTCATA TTTGATTGGA TTTACAGTGG 120 

20 TTTCAGCAGT GTGCTACAGT TTTTAGGATT ATATAAGAAA ACTGGTAAAC TGGTATTTCT 180 

TGGATTGGAT AATGCAGGAA AAACAACATT GCTACACATG CTAAAAGATG ACAGACTTCG 240 

ACAACATGTC CCAACATTAC ATCCCACTTC CGAAGAACTG ACCATTGCTG GCATGACGTT 300 

25 

TACAACTTTT GATCTGGGTG GACATGTTCA AGCTCGAAGA GTGTGGAAAA ACTACCTTCC 360 

TGCTATCAAT GGCATTGTAT TTCTGGTGGA TTGTGCAGAC CACGAAAGGC TCTTAGAGTC 420 

30 AAAAGAAGAA CTTGATTCAC TAATGACAGA TGAAACCATT GCTAATGTGC CTATACTGAT 480 

TCTTGGGAAT AAGATCGACA GACCTGAAGC CATCAGTGAA GAQAGGTTGC GAGAGATGTT 540 

TGGTTTATAT GGTCAGACAA CAGGAAAGGG GAGTATATCT CTGAAAGAAC TGAATGCCCG 600 

ACCCTTAGAA GTTTTCATGT GTAGTGTGCT CAAAAGACAA GGTTACGGAG AAGGCTTCCG 660 



CTGGATGGCA CAGTACATTG ATTAACACAA ACTCACATTG GTTCCAGGTC TCAACGTTCA 720 

40 GGCTTACTCA GAGATTTGAT TGCTCAACAT GCATAACTTG AATTCAATAG ACTTTOXSCTG 780 

GTTATAAAAC AGATGTTTTT TAGATTATTA ATATTAAATC AACTTAATTT GAATGAGAAT 840 

TGAAAACTGA TTCAAGTAAG TTTGAGTATC ACAATGTTAG CTTTCTAATT CCATAAAAGT 900 

45 

ACTTGGTTTT TACAGTTTAT AATCTGACAT CACCCCAGCG CCATTTGTAA AGAGCAACTT 960 

TCCAGCAGTA CATTTGAAGC ACTTTTTAAC AACATGAAAC TATAAACCAT ATTTAAAAGC 1020 

50 TCATCATGTT AAATTTTTTA TGTACTTTTC TGGAACTAGT TTTTAAATTT TAGATTATAT 1080 

GTCCACCTAT CKTAAGTGTA CAGTTAATAA TTAGCTTATT CAATGATTQC ATGATGCCTT 1140 

ACAGTTTTCA ATAACTTTTT TTCTTATGCA AACGTCATGC AATAAAACAA ACTCTAATGT 1200 

TTGGCAAAAA AAAAAAAAAA AAANTCGA 1228 



60 
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(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1327 bsLse pairs 
-5 (B) TYPE; nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 

10 



TCTCGGGTCT CGGGACAGGT GAGCACCCTG ATGAAGGCCA CGGTCCTGAT GCGGCACCTG 60 

GGCGGGTGCA GGAGATCGTG GGCGCCCTCC GCAAGGGCGS CGGAGACCGG TTACAGGTGA 120 

15 TTTCTCATTT TRACATGACC TTGAGCAGGT TTGCATATAA TGGAAAGCGA TGCCCTTCTT 180 

CTTACAATAT TCTGGATAAT AGCAAGATCA TCAGTGAGGA GTGTCGGAAA GAGCTCACAG 240 

CGCTCCTTCA CCACTATTAC CCAATTGAGA TCGACCCACA CCGGACCGTC AAGGAGAAGC 300 

20 

TACCTCATAT GGTGGAATGG TGGACCAAAG CGCACAATCT CCTATGTCAG CAGAAGATTC 360 

AGAAGTTTCA GATAGCCCAG GTGGTTAGAG AGTCCAATGC AATGCTCAGG GAGGGATATA 420 

25 AGACCTTCTT CAACACACTC TACCATAACA ACATTCCCCT TTTCATCTTT TCTGCGQGCA 480 

TTCGTGATAT CCTGGAAGAA ATTATCCGAC AGATGAAAGT GTTCCACCCC AACATCCACA 540 

TCGTGTCTAA CTACATGGAT TTTAATGAAG ATGGTTTTCT CCAGGGATTT AAGGGCCAGC 600 

30 

TGATACACAC ATACAACAAG AACAGCTCTG TGTGTGAGAA CTGTGGTTAC TTCCAGCAAC 660 

TTGAGGGCAA AACCAATGTC ATCCTGCTGG GAGACTCTAT CGGGGACCTC ACCATGGCCG 720 

35 ATCGGGTTCC TGGTGTCCAG AACATTCTCA AAATTGGCTT CCTGAATGAC AAGGTGGAGG 780 

-AGCGGCGGGA- NeGCTAACAT -GGACTCeTAT-GACATCGTGC^TSGAGAAGGA-CSAC^ 840 

GATGTGGTCA ACGGGCTACT GCAGCACATC CTGTGCCNAG GGGGTCCAGC TGGAGATGCA 900 

40 

AGGCCCCTGA AGGCGCAGGC TCCNAAGKCC SCTGCAGGCC GTGGTGAGGA GGGQCGCCTC 960 

CCCAGAGTCT GCTCCCCCGT GAACACAGAG CAGAGCCAGG GTGGCCAGCA GTGGCTGGGT 1020 

45 CCTTCCGCGC CCCTCCGTCC TCCTTTCCCT GAGCACCTTC ATCACCAGAG GCTTGAAGGA 1080 

ACCCCGCCAT GTGGCAQGGC ACAGGCACTG TTCCTGGTGA ACCTTGGACC ACAGCATGTC 1140 

AGTGCTCTAG GGATTGTCTA CTCCAGGGAT TTTCTTCAAA ATTTTTAAAC ATGGGAAGTT 1200 

50 

CAAACAAATA TAATGTGTGA AACAGATCAA AATTTTTAAA ATGAAAAAAA AGCTGCTCTG 1260 

ATTCAGGGGA TGTGGGTCGG GGTAGAACCT GGACCTCITG GCCTGGGGGC ACATGGGATG 1320 

55 CTTCTAG 1327 



60 (2) INFORMATION FOR SEQ ID NO: 281: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCEIIPTION: SEQ ID NO: 281: 

10 TCACCCTGCC TACAGCGTGG AGCTCAGATG ACTGCGCCCT CCACGGTCAC TGTGAGCAGG 60 

TGGTATTCAC AGCCTGCATG ACCCTCACGG CCAGCCCTGG GGTGTTCCCC GTCACTGTGT 120 

GGCTTTGGCT GAAGCCTAAT TCCACAGCTC CI"lXJl' i ' r fr r GAGAGAGACT GAGAGAACCA 180 

15 

TAATCdTGC CTGCTGAACC CAGCCTGGGC CTGGATGCTC TGTGAATACA TTATCTTGCG 240 

ATGTTGGGTT ATTCCAGCCA AAGACATTTC AAGTGCCTGT AACTGATTTG TACATATTTA 300 

20 TAAAAATCTA TTCAGAAATT GGTCCAATAA TGCACGTGCT TTGCCCTGGG TACAGCCAGA 360 

GCCCTTCAAC CCCACCTTGG ACTTGiAGGAC CTACCTGATG GGACGTTTCC ACGTGTCTCT 420 

AGAGAAGGAT TCCTGGATCT AGCTGGTCAC GACGATGTTT TCACCAAGGT CACAGGAGCA 480 

25 

TTGCGTCGCT GATGGGGTTG AAGTTTGGTT TQGTTCTTGT TTCAGCCCAA TATGTAGAGA 540 

ACATTTGAAA CAGTCTGCAC CTTTGATACG GTATTGCATT TCCAAAGCCA CCAATCCATT 600 

30 TTGTGGATTT TATGTGTCTG TGGCTTAATA ATCATAGTAA CAACAATAAT ACCTTTTTCT 660 

CCATTTTGCT TGCAGGAAAC ATACCTTAAG TTTTTTTTGT TrTGTTTTTG TTTTTTTGTT 720 

TTTTGTTTTC CTTTATGAAG AAAAAATAAA ATAGTCACAT TTTTAATACY AAAAAATGGA 780 

CAAAAAAAGT CGAGGGGGG 799 



35 



40 
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(2) IIJFORMATION FOR SEQ ID NO: 282: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2196 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

AAAGACTCTA ACATCCATGA GCTTGAACAT GAGCAAGAGC CTACTTGTQC CKSCCAGATG 60 

GCTGAGCCCT TCCGTACCTT CCGAGATGGA TGGGTCTCCT ACTACAACCA GCCTGTGTTT 120 

55 CTGGCTGGCA TGGGTCTTGC TTTCCTTTAT ATGACTGTCC TGGGCTTTGA CTGCATCACC 180 

ACAGGGTACG CCTACACTCA GGGACTGAGT GGTTCCATCC TCAGTATTTT GATGGGAGCA 240 

TCAGCTATAA CTGGAATAAT GGGAACTGTA GCTTTTACTT GGCTACGTCG AAAATGTGGT 300 

60 
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TTGGTTCGGA CAGGTCTGAT CTCAGGATTG GCACAGCTTT CCTGTTTGAT CTTGTGTGTO 360 
ATCTCTGTAT TCATGCCTGG AAGCCCCXHXS GACTTGTCCG TTTCTCCTTT TGAAGATATC 420 
CGATCAAGGT TCATTCAAGG AGAGTCAATT ACACCTACCA AGATACCTGA AATTACAACT 480 
GAAATATACA TGTCTAATGG GTCTAATTCT GCTAATATTG TCCCGGAGAC AAGTCCTGAA 540 
TCTGTGCCCA TAATCTCTGT CAGTCTGCTG TTTGCAGGCG TCATTGCTGC TAGAATCGGT 600 
CTTTGGTCCT TTGATTTAAC TGTGACACAG TTGCTGCAAG AAAATGTAAT TGAATCTGAA 660 
AGAGGCATTA TAAATGGTGT ACAGAACTCC ATG7ACTATC TTCTTGATCT TCTGCATTTC 720 
ATCATGGTCA TCCTGGCTCC AAATCCTGAA GCTTTTGGCT TGCTCGTATT GATTTCAGTC 780 
TCCTTTGTGG CAATGGGCCA CATTATGTAT TTCCGATTTG CCCAAAATAC TCTGGGAAAC 840 
AAGCTCTTTG CTTGCGGTCC TGATGCAAAA GAAGTTAGGA AGGAAAATCA AGCAAATACA 900 
TCTGTTGTTT GAGACAGTTT AACTGTTGCT ATCCTGTTAC TAGATTATAT AGAGCACATG 960 

TGCTTATTTT GTACTGCAGA ATTCCAATAA ATGGCTGGGT GTTTTGCTCT GTTTTTACCA 1020 

CAGCTGTGCC TTGAGAACTA AAAGCTGTTT AGGAAACCTA AGTCAGCAGA AATTAACTGA 1080 

TTAATTTCCC TTATGTTGAG GCATGGAAAA AAAATTGGAA AAGAAAAACT CAGTTTAAAT 1140 

ACGGAGACTA TAATGATAAC ACTGAATTCC CCTATTTCTC ATGAGTAGAT ACAATCOTAC 1200 

GTAAAAGAGT GGTTAGTCAC GTGAATTCAG TTATCATTTG ACAGATTCTT ATCTGTACTA 1260 

GAATTCAGAT ATGTCAGTTT TCTGCAAAAC TCACTCTTGT TCAAGACTAG CTAATTTATT 1320 

TTTTTGCATC TTAGTTATTT TTAAAAACAA ATTCTTCAAG TATGAAGACT AAATTTTGAT 1380 

- AACTAATATT JVTGCTTATTG ATCCTATTGA -TCTTAAGGTA. TTTACATGTA -OXSTQGAAAAA- 1440 - 

CAAAACACTT AACTAGAATT CTCTAATAAG GTTTATGGTT TAGCTTAAAG AGCACCTTTG 1500 

TATTTTTATT ATCAGATGGG GCAACATATT GTATGAAGCA TATGTAGCAC TTCACAGCAT 1560 

GGTTATCATG TAAGCTGCAG GTAGAAGCAA AGCTGTAAAG TAGATTTATC ACACAATGAC 1620 

TGCATACAGA CTTCAAATAT GTCAATAGTT TGGTCATAGA ACCTAGAAGC CAAAAGCCAC 1680 

ACAGAAGGGC AAGAATCCCA ATTTAACTCA TGTTATCATC ATTAGTGATC TGTGTTGTAG 1740 

AACATGAGGG TGTAAGCCTT CAGCCTGGCA AGTTACATGT AGAAAGCCCA CACTTGTGAA 1800 

GGTTTTGTTT TACAAATCAC TTGATTTAAC ACACTCAGGT AGAATATTTT TATTTTTACT 1860 

GTTTTATACC CAGAACTTAT TTCTACATTG TTCTACAGCA AGAATATTCA TAAAAGTATC 1920 

CCTTTCAAAT GCCTTTGAGA AGAATAGAAG AAAAAAAGTT TGTATATATT TTAAAAAATT 1980 

GTTTTAAAAG TCAGTTTGCA ACATGTCTGT ACCAAGATGG TACITTGCCT TAACCGTTTA 2040 

TATGCACTTT CATGGAGACT GCAATACGTT GCTATGAGCA CITTCTTTAT CCTTGGAGTT 2100 
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TAATCCTTTG CTTCATCTTT CTACAGTATG ACATAATGAT TTGCTATGTT GTAAAATCTT 2160 
TGTAAAAAAT TTCTATATAA AATATTTGAA ACTTAA 2196 
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(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

GCAGTTAAGG CTTCTGATAA GGAAAGAGAG TCTGAACAGA GCACACACAT CTGGAGCTCC 60 

AGGAGTGGGG GATGCAGCAT CAGATTCCAT CTTGAATTTC TGCTAAAATA CTTTGTACTC 120 

ATAATGGATC TCAACAAAGA TCTGTATTTC ATCTGTGGGT CCATCTTCCC TCTGGGTCAA 180 

GTAGATGTTA AGCTGGACCT TGGCACGCCT CTTAACATGA AGAGATCTAG CTAGACAGAC 240 

AGACTCCCCC ATTTATGGGA ACAAGAATTC AATTTATTCT CTATTTATAA AACATTTTTT 300 

TAAAGTGCCT TGGGTATAAA AATCTAAATG TCTGCGGTGT GATCAGTCAG GAGCACGTAA 360 

CTATCACTCT TCGCATCCTT TGGTCACTGG GAGATCCTTT GGGGGCTGGG AGGTCCTTCT 420 

GTCCCAGGCT AAAGGAAAAG CTTCACAAGG GTAAGAGCCA CAGAACCCTC GGCAAGAAAG 480 

GCCGGTCAGG GAGAATGAAT GGTACAGAGA GGAAAGGAAG GAAAGGGGGT GGAACAGAGG 540 

TAGAAGGCAA GGAAGGGATG CCGCACTGG A GACCGATGGG GACACTCTAA TTGTGCAAGA 600 

GGGAGGATCT TCCTTCTTGA ATGCTGAACA CAGCTAGTCT GAACCTTCCT TGGAAAGTCC 660 

AGCTGTTTGC CCATGCATAG GGCCAACTCT CCCTGCAAAG CAGCAAATGT GGCTTCTATC 720 

AGGAAGGAAA AGTATCCATC AGTGTGACAA GAGGTCACCT TCGAACTTGC ATGAACTCCT 780 

TGCGCAGCCA CAAAGAGTCC TQGTAGAAGT GAGGATCGCC TAGTCTTACG GCTGTCCGTT 840 

TATAGAAGTA GCAGTACAAC ACTGCTGCTA GTCTCTGGAA TACAAACAGC ATTTGAAGTC 900 

CATCTGTCCA TATGAAGCTG TTGGAGTTTT TCCAGCGTAA GTTCATGACC CAGACATGAA 960 

GGGAGATGCT GAGGGCAAAG TACACAGCTG TCAGGATGAT GGTCCCTTTG AACTTATGGA 1020 

ATAGGAGGTT GACCAGGCCA GCCTGGAAGA CGAAGGTGTT GAAGAACATG AGGAAAATGA 1080 

TGATGATGTT GAAGAGGACT GCAATATCCT GGATGCACTG AGGGAQAGGY TTCTAGTTCC 1140 

TTTGAATGAG AGCTGTTTCC CTTGCTCTAA GGCAAGCACC TCCAA 1185 
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(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1634 base pairs 
5 (B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

10 

AGQGAAAGGG GAGGGTAGCG GGAGGGTAGC AGGTGAGTTC CTAGGGCTGG AAGGTTTAGC 60 

AGCAGCCTGG TGCAGTGCCC TGTCATCAAG ACAAACCCAC GGTCCTMCTG GGTGCCTACC 120 

15 AAGCTTGGTT TGTACAAAAG CAAGGTGGGA GTCTATTTTT GTACATGAGA TACATCACAC 180 

TTACCTGTGG GCCAGTATTG TGAAGTGAGT CTGAGTTGTT TACACTGATG CCTTCCCTGC 240 

CCACCACAAA TTGTGTACAT AGTCTTCAGA TGATACCACC CCTTTCCCCA GCTCCCAACC 300 

20 

AAGAGCTGGT TCTAGGCCTG TGTTATATGT CATATTTAGC STTTTTATAT ATGACCTTTG 360 

A1TTCTGTTG TTTGTATTTT AGCACAGTGT ATGCACCTTC ATTTAAATAC ATCTGTGTGC 420 

25 ATACAGATAC GCATATATGT GTGTGCGTAT GCATATATCT CTCATCTGTA GTTTCCAAGA 480 

GTTCAGCTGA AGCAGATGGA GTCCTGCAGC CCAGGAGACA CCCTGCATCC CTGCTAATAG 540 

TGTTTGCCAC AAGTATTAGT GAGTCTTCCT TATTAATATT TTCATTTCAG AAGACTGAAG 600 

30 

CAAAGCTGAT AGTGTTTGCT GTTTCTTTGG CAGCTAAGTG AGGGTCTTGG GATGACTTGC 660 

TGTGTTCCTC AAGCTGCACT TTGGGGCCAT CTCTGCAGTA TTAGCCCCCT TTTTGCTTGG 720 

35 TGGTACTCTG TCTGTGCCTG TGTGTGTGTG TGATAGTCAC TCTTGCATGG CTTCCATGTC 780 

TGGTTTGTGG -CATTTGGGGA TAAQGTGCTG AAGGCAGAGC ATTTGGAGTT TGTTTGAGGG- -840 

CTCGTTGCCA ATGATAGATC ACTCCTGTTG ACCTGGTATG TCTGCTTGCT TGCTGCTTTT 900 

40 

CCTTGCTTTC TCTTGGAAGA GGAAAGGACT CTGGTCAGGC CCAGGCTGAG TGAGATGAQC 960 

TGCAGCTGGC TCATGGCCTT CTTAGAGCAG AGAGAGGAGT ATGTCATTTT ACTAAGTTCC 1020 

45 TAAACAAACA TTTATGCAGG CAACACTCCT TGCAGATCCA GAAACTGAGG CACAATAGGG 1080 

TTATGACTTG CTCAAGAATA TGTAGCTGCT AGGGGGTAAA TCAAGGCATC ACAATTTCTG 1140 

TTCAGCGGGC AGGAATAGGC TGTGAATTGC TAGCACTTTT TTTTTTTAAG CAATTACTTT 1200 

50 

TTGACTTGTT CCTCTGAAAG TGCAAGAGGC GTACACCTTT CCCAAATGTA GACTAGAATC 1260 

TGCAGQATGC CACCCACTGT ATAGTTCTGC TTTCCCAGAG AGGAAGAACT TTTAGAAACC 1320 

55 AAATGATCTT AATTGTTATT GCCCACCCCT GGCTTTTCCG GGTAGAAAAT TCACAGTAGG 1380 

AATGATTGTT AAGAGAGAGT GCTTGGAACC ATGGGTTAAC AGGAAAGGCT ACCTAACTTC 1440 

ACATATCTGC AACCAGAGCA GCCACCAAGC ATTACTTAGC AGCAGGAAAA TGATTCTATT 1500 
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TGAGTTCCTG TGTGTCCAAA ACTGAGGCAC CATGTTCTTT GAAAACATGC CACCTCAAGG 1560 
CTGGGCGCGG TGGCTCACAC CTGTAATCCC AGCAYTTTGG GGAGGCCSAG GGCGGGGCGG 1620 
5 KTTCACCGGG GGTC 1634 
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(2) INFORMATION FOR SEQ ID NO: 285: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOliOGy: linesir 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 
TTCCCCCCAG GTTGGCTTCC TTCGATTCCT TTTCTTGGTA TCAACGTTTG ATTGGAAGAA 60 

CAACCCCCTC TTTGTCAACC TCAATAATGA GCTCACTGTG GAGGAGCAGC TCGQGCACAG 120 

CTCMCCGTYA TCGTCATTGT TACCCCCCAA GACCGCAAAA ACTCTGTGTG GACACAGGAT 180 

GGACCCTCAG CCCAGATCCT GCAGCAGCTT GTGGTCCTGG CAGCTGAAGC CCTQCCCATG 240 

TTAGAGAAGC AGCTCATGGA TCCCCGGGGA CCTGGGGACA TCAGGACAGT GTTCCGGCCG 300 

CCCTTGGACA TTTACGACGT GCTGATTCGC CTGTYTCCTC GCCATATCCC GCGGCACCGC 360 ^ 

AGGCTTGTGG ACTCGCCAGY TGCCTCCTTC TGCCGGGGCC TGCTCAGCCA GCCGGGGCCC 420 

TCATCCCTGA TGCCCGTGCT GGGTNATGAT CCTNCTCAGC TCTATCTGAC GCAGCTCAGG 480 

GAGGCCTTTG GGGATCTGGC CCTTTTCTTC TATGACCAGC ATGGTGGAGA GGTGATTGGT ^^S^ 

GTCCTCTGGA AGCCCACCAG CTTCCAGCCG CAGCCCTTCA AGGCCTCCAG CACAAAGGGG 600 

CGCATGGTGA TGTCTCGAGG TGGGGAGCTA GTAATGC5TGC CCAATGTTGA AGCAATCCTG 660 

GAGGACTTTG CTGTGCTGGG TGAAGGCCTG GTGCAGACTG TGGAGGCCOG AAGTGAGAGG 720 

TGGACTGTGT GATCCCAGCT CTGGAGCAAG CTGTAGACGG ACAGCAGGAC ATTQGACCTC 780 

TAGAGCAAGA TGTCAGTAGG ATGACCTCCA CCCTCdTGG ACATGAATCC TCCATGGAGG 840 

GCCTGCTGGC TGAACATGCT GAATCATCTC CAACAAAACC CAGCCCCAAC TTTCTCTCTG 900 

ATGCTCCAGC ATTGGGGCAG GGGCATGGTG GCCCATGTAG TCTCCTGGGC CTCACCATCC 960 

CAGAAGAGGA GTGGGAGCCA GCTCAGAGAA GGAACTGAAC CCAGGAGATC CATCCACCTA 1020 

TTAGCCCTGG GCCTGGACCT CCCTGCGATT TCCCACTCCT TTCTTAGTCT TCTTCCAGAA 1080 

ACAGAGAAGG GGATGTGTGC CTGGGAGAGG CTCTGTCTCC TTCCTGCTGC CAGGACCTGT 1140 

GCCTAGACTT AGCATGCCCT TCACTGCAGT GTCAGGCCTT TAGATGGGAC CCAGCGAAAA 1200 

TGTCGCCCTT CTCAGTCACA TCACCGACAC TGAGCAGTGG AAAGGGGCTA TATGTGTATG 1260 
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AATAGACCAC ATTGAAGGAG CACAATGCCC TCCTGTGTTG ATGCCACTTTC CCAGGGTQGA 1320 



GACAGTGGAA AAGAACCGAG GACAGGAAAG GATTGGGTAG GTGAAGGGGT CAGGGGACTG 1380 

- 5 

GTAGTCACCC AATCTTGGAG AGGTGCAAAA AGCACTGGGG GCTACCCGTT AGCTGCATCT 1440 



GCCCTGGCTG TTTGCCCX3TT CATGTCACAA ACTGCCACTA CTATGTACCT GCAGTGGGGT 1500 



10 TGCAGAGATG GGGGAGACTC AAGTCTTACT CCCCAGGAGC TCCCAGGGCC CAAGGAGGAG 1560 



AATGCTGCCT CCTTTCAGTC TGGTCTACAC CXACTTTCTG GTAGCCTCTC TGCTTCCTGT 1620 



AATTCTGGCT GTTTTTCCAG ACTCAGCTCA AATAGTGCCC CTCCTTAAGC CX:ATCCCTCG 1680 

15 

CCCCCAGCCT GAGGTGATCT TTCCCTCCTC TGAACTATTA GAGCAGTTAC TGTCTGTTCA 1740 
GTTCGTTTGG CAGGCACACA CAGTGGCATA AATTCTATTG TTTTGAACTC TGATT 1795 

20 

(2) INFORMATION FOR SEQ ID NO: 286: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 



TCTGCTTTCG GTGCTGCGTG TACTGCTGGG CGGCTTCTTC GCGCTCGTGG GGTTGGCCAA 60 



35 GCTCTCGGAG GAGATCTCGG CTCCAGTTTC GGAGCGGATG AATGCCCTGT TCGTGCAGTT 120 



TGGTGAGGTG TTGGGGGTGA AGGTATTTGG CTACCAGCCA- GATCCCCTGA ACTACCAAAT" ^ "180" " 



AGCTGTGGGC TTTCTGGAAC TGCTGGCTGG GTTGCTGCTG GTCATGGGCC CACCGATGCT 240 

40 

GCAAGAGATC AGTAACTTGT TCTTGATTCT GCTCATGATG GGGGCTATCT TCACCTTGGC 300 



AGCTCTGAAA GAGTCACTAA GCACCTGTAT CCCAGCCATT GTCTGCCTGG GGTTCCTGCT 360 



45 GCTGCTGAAT GTCGGCCAGC TCTTAGCCCA GACTAAGAAG GTGGTCAGAC CCACTAGGAA 420 



GAAGACTCTA AGTACATTCA AGGAATCCTG GAAGTAGAGC ATCTCTGTCT CTTTATGCCA 480 



TGCAGCTGTC ACAGCAGGAA CATGGTAGAA CACAGAGTCT ATCATCTTGT TACCAGTATA 540 

50 

ATATCCAGGG TCAGCCAGTG TTGAAAGAGA CATTTTGTCT ACCTGGCACT GCTTTCTCTT 600 



TTTAGCTTTA CTACTCTTTT GTGAGGAGTA CATGTTATGC ATATTAACAT TCCTCATGTC 660 



55 ATATGAAAAT ACAAAATAAG CAGAAAAGAA ATTTAAATCA ACCAAAATTC TGATGCCCCA 720 



AATAACCACT TTTAATGCCT TGGTGTAAGT ATACCTCTGA ACTTTTTTCT GTGCCTTTAA 780 



ACAGATATAT ATTTrTTTTT AATGAAAATA AAACCATATA TCCTATTTTA TTTCCTCCTT 840 

60 
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TTAAAACCTT ATAAACTA 858 



(2) INFORMATION FOR SEQ ID hTO: 287: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 915 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 
GAATTCGGCA CGAGCGCGGC CATGGCGCTC CTGCTTTCGG TGCTGCGTGT ACTGCTGGGC 60 

GGCTTCTTCG CGCTCGTOGG GTTGGCCAAG CTCTCGGAGG AGATCTCGGC TCCAGTTTCG 120 

20 GAGCGGATGA ATGCCCTGTT CGTGCAGTTT GCTGAGGTGT TCCCGCTGAA GGTATTTGGC 180 

TACCAGCCAG ATCCCCTGAA CTACCAAATA GCTGTGGGCT TTCTGGAACT GCTGGCTGGG 240 

TTGCTGCTGG TCATGGGCCC ACCGATGCTG CAAGAGATCA GTAACTTGTT CTTGATTCTG 300 

25 

CTCATGATGG GGGCTATCTT CACCTTGGCA GCTCTGAAAG AGTCACTAAG CACCTGTATC 360 

CCAGCCATTG TCTGCCTGGG GTTCCTGCTG CTGCTGAATG TCGGCCAGCT CTTAGCCCAG 420 

30 ACTAAGAAGG TGGTCAGACC CACTAGGAAG AAGACTCTAA GTACATTCAA GGAATCCTGG 480 

AAGTAGAGCA TCTCTGTCTC TTTATGCCAT GCAGCTGTCA CAGCAGGAAC ATGGTAGAAC 540 

ACAGAGTCTA TCATCTTGTT ACCAGTATAA TATCCAGGGT CAGCCAGTGT TGAAAGAGAC 600 

35 

ATTTTCTCTA CCTGGCACTG CrTT^ TTAGCTTTAC TACnCTlTTC_ TGMGW3TAC 660 

ATGTTATGCA TATTAACATT CCTCATGTCA TATGAAAATA CAAAATAAGC AGAAAAGAAA 720 

40 TTTAAATCAA CCAAAATTCT GATGCCCCAA ATAACCACTT TTAATGCCTT GGTGTAAGTA 780 

TACCTCTGAA CTTTTTTCTG TGCCTTTAAA CAGATATATA TTTITTTTWA ATGAAAATAA 840 

AACCATATAT CCTATTTTAT TTCCTCCTTT TAAAACCTTA TAAACTATAA MAAAAAAAAA 900 
AAAAAAAAAA CTCGA 



50 

(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1517 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

60 
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CCTTGTGGCA ACTAGTGGGT CCCCCGGGCT GCAGWAATTC GGGCAGTGGT TCTGNGTCTG 60 

AAGATACTCT GAGTTCCTCT GAGAGATCCA AAGGCTCCGG GAGCAGACCC CCAACCCCCA 120 

AAAGCAGCCC TCAGAAGACC AGGAAGAGCC CTCAGGTGAC CAGGGGTAGC CCTCAiGAAGA 180 

CCAGCTGTAG CCCTCAGAAG ACCAGGCAGA GCCCTCAGAC GCTGAAGCGG AGCCX3AGTGA 240 

CCACCTCACT TGAAGCTTTG CCCACAGGAC AGTGCTGACA GACAAGAGTG GGCGACAGTG 300 

GAAGCTGAAG TCCTTCCAGA CCAGGGACAA CCAGGGCATT CTCTATGAAG CTGCACCCAC 360 

CTCCACCCTC ACCTGTGACT CAGGACCACA GAAGCAAAAG TTCTCACTCA AACTGGATGC 420 

CAAGGATGGG CGCITGTTCA ATGAGCAGAA CTTCTTCCAG CXX3GCCGCCA AGCCTCTGCA 480 

AGTCAACAAG TGGAAGAAGC TGTACTCGAC CCCACTGCTG GCCATCCCTA CXTTGCATGCG 540 

TTTCGGTGTT CACCAGGACA AATACAGGTT CTTGGTGTTA CCCAGCCTGG GGAGGAGCCT 600 

TCAGTCGGCC CTGGATGTCA GCCCAAAGCA TGTGCTGTGC AGAGAGGTCT GTGCTGCAGG 660 

TGGCCTGCCG GCTGCTGGAT GCCCTGGAGT TCCTCCATGA GAATGAGTAT GTTCATGGAA 720 

ATGTGACAGC TGAAAATATC TTTGTGGATC CAGAGGACCA GAGTCAGGTG ACTTTGGCAG 780 

GCTATGGCTT CGCNTTCCGC TATTGOXAA GTGGCAAACA CGTGGCCTAC GTGGAAGGCA 840 

GCAGGAGCCY TCACGAGGGG GACCTTGAGT TTCATTAGCA TGGACCTGCA CAAGGGATGC 900 

GGGCCCTCCC GCCGCRGYGA CCTCXAGAGC CTGGGYTAMT GCATGCTGAA GTGGVTCTAM 960 

GGGTTTCTGC CATGGACAAA TTGCCTTCCA AMAMTGAGGA CATCATGAAG CAAAAACAGA 1020 

AGTTGCCTTG GGATTCATTT TAATGTAAGC TKGACnTGT CATGCCAGAA ACAAGGCTCG 1080 
GTCACCGTCA GCAGTTTCGA GTTTTCGAGC- TGGWGGGAGTE^ TCGTCCGTGT? -GGTTSAGGCA - - -1140^ 

GATATCTCCG TTATGCAGCC GCCTCCGGGG GACCACCTCC CTCCCTTTGA GTCAGCCACA 1200 

GACAGCCTAC TTGACGGCCC CGCTQGCCCC CACATTCCAC TGAACTGTGC GGATGCCACA 1260 

GTGACCCCCT CTCAGGCACA GCATGACCTC CTGAAGTCGA GCCTGCTTGC TTTGAACXTTA 1320 

CCAGTTAAAA TCTCCTCAAA ATGTTTGGAT ACCGCCCATT GGCCCCTCAC AGCCACGAGC 1380 

TCCCTGACCA GTGTGCGTGT GTGTGTGTGT GTGTGTCTGT GTGTGTGCTT GGGACGGGTG 1440 

GGGAGGTCAC CTTTGGGTGT GCGGTGTGCC CCCAGGACCT GTAAGTAATA AAATCTTTAT 1500 

TTCCAAAAAA AAAAAAA 1517 



55 

(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3865 base pairs 
60 (B) TYPE: nucleic acid 



SDOCID: <W0_983944aA2_lA> 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linesir 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 
TGGAGGGGGG GAGCTTCCTT GAGCAGTGGG CCCAGGCCTG GCCCTCCACA CTTCATTCTC 
TGACCTTTCT CTCTCCTCAT TTCGGTGCAT GTCCTTTCTG CAGCTGCCTT TCAGCACAGG 
TQGTTCCACT GGGGGCAGCT AACGCTGAGT GACAAGGATG GGAAGCCACA GGTGCATTTT 
ACTCAAGTCT TCTCT/^GTCA ATGAGGGGCA CCCAGTGCTT CTAGGGCAGG CTGGGTGGTG 
GTCCCCTAGG TATCAGCCTC TCTTACTGTA CTCTCCQGGA ATGTTAACCT TTCTATTTTC 
AGCCTGTGCC ACCTGTCTAG GCAAGCTGGC TTCCCCATTG GCCCCTGTGG GTCCACAGCA 
GCGTGGCTSC CCCCCAGGGC CACCGCTTCT TTCTTGATCC TCTTTCCTTA ACAGTGACTT 
GGGCTTGAGT CTGGCAAGGA ACCTTGCTTT TAGCTTCACC ACCAAGGAGA GAGGTTGACA 
TGACCTCCCC GCCCCCTCAC CAAGGCTGGG AACAGAGGGG ATGTGGTGAG AGCCAGGTTC 
CTCTGGCCCT CTCCAGGGTG TTTTCCACTA GTCACTACTG TCTTCTCCTT GTAGCTAATC 
AATCAATATT CTTCCCTTGC CTGTGGGCAG TNGGAGAGTG CTGCTGGGTG TACGCTGCAC 
CTGCCCACTG AGTTGGGGAA AGAGGATAAT CAGTGAGCAC TGTTCTGCTC AGAGCTOCTG 
ATCTACCCCA CCCCCTAGGA TCCAGGACTG GGTCAAAGCT GCATGAAACC AGGCCCTGGC 
AGCAACCCTG GGAATGGCTG GAGGTGGGAG AGAACCTGAC TTCTCTTTCC CTCTCCCTCC 
TCCAACATTA CTGGAACTCT ATCCTGTTAG GATCTTCTGA GCTTGTTTCC CTGCTGGGTG 
GGACAGAGGA CAAAGGAGAA GGGAGGGTCT AGAAGAGGCA GCCCTTCriT GTCCTCTGGG 



GTAAATGAGC TTGACCTAGA GTAAATGGAG AGACCAAAAG CCTCTGATTT TTAATTTCCA 
TAAAATGTTA GAAGTATATA TATACATATA TATATTTCTT TAAATTTTTG AGTCTTTGAT 
ATGTCTAAAA ATCCATTCCC TCTGCCCTGA AGCCTGAGTG AGACACATGA AGAAAACTGT 
GTTTCATTTA AAGATGTTAA TTAAATGATT GAAACTTGGC TGTGGCTACT GCTTCTTAAT 
GTTGGGGGGA CAGGGCAGTG GTCTGGGCCC ACATTTAGAA QGGAAAATGT TTTGCCTGCT 
GCACACATTG GACCCAAGTA TGGGCCTCTT CTGCCTAGTA CTGCCAAAGG GACTGTTAAG 
GTGTCTTGTC CATCTTCTAC CCCCCACCCC CCATTACGGG TAAAGGRAAC CCCAGACTAG 
GTGAGGQGCC AQCAGCTGCC TCACATTGTG TTCTCTCCTG AGATGGTCCA GCTCACATCC 
AGACACCTTG TTCAGACATT TTATTTGAAT TTATGACAGT GATGGGGATT TGACTGAGAT 
GCCTTATGGA GAAGTACCCC ACCCTCTATG AAGACAGAAT CACTCTCTGC CATTCATTCT 
GCCTGATGCT AACAACACGC AGCTGATTTA GGGAGTGTCC CAGCCTAGCT GGATCAAGGG 
AAATTCCAGG AGCCCTGGGG CAGGCCCTGG NCCCCAGTGC CAAGCCTCAG AGTAAGCAGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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CATTGGGAAA GTTGCCAACC ACTTGGTAGA CCACTAGGTT CTCTGTTTTC CCTTCCCTTT 1740 

CCTTTTCAAA TCCCACAGTT TCCTGTTGGG GAGAAGCTGT AATTAGCCTA GTCCAGGTAC 1800 

CAGATCCCAG CTAGGGGCX3C AGCTGNCTTG GATAACTCCA AGAAAACCTG GGCACCAGTA 1860 

TTTTTCCAAT TATAAGGACT GTGGCATAAA TTTTTAAATG AGTTATATTG AAACCAGATT 1920 

TCTCCAGCTG CCAAGGGAAG AAGGTAGGGC TGGACTCCCT GCTGTGGCCC AGCCCTTGTT 1980 

AGGGGTTGGT CTCTCACTGC AGCCAGACAG GATGATCCTG GGTTCTGGGG AGGGTAAGCT 2040 

GCCCCTTGCC GAGTTCTGCA CCGAATAAAG AGTCCAAACC CGCTGCTTCC GTGTCCTGAG 2100 

AGATGGGTAA ATGGGTGATG GATGGAGCAG ACTGAAGAGA CAGCAGATGA CTCAGTGGTG 2160 

GAAGAAGGGG GGAAGATGCT GGGCTGGCTA GCTAATGTTC CCCXXTTTTCA GCGATTTACA 2220 

GGAAATGGAG CCCAGCTTGG TCATGAAGTT GGTTTGCTTC CACTGTGCGA TGCACTCCTC 2280 

AGAAATOTTG AAGTCAGCCT GCAACTTCTC GAAGACTTTC 1TCTTGGGCT TGAGCTCCTC 2340 

ATCTGGTTGG CCCTTTTCAT AGCCCTTCAC AAACACGTGC TCACCAGGAG CAGAGCCTGC 2400 

CGGAGGGTCC AGAGGTTCAA CTGGCGGTTT ATCCCTTCTA TAGAAGCACA CAGAAGCATG 2460 

CCTTGGGACT CGACTCCTCT CATCTTCTGG GGTTTCAGGT TGCACAGCAC CACTACCAGC 2520 

CTGTCCTGCA GTTCCTCCTT GGGCACGAAC TGTACCAGGC CGCTCACCAC AGTCCX3TGGT 2580 

TCAGCTTCCC CCACGTCAAT CTTCTCTACA TACAGGCTGT CTGCATCTGG GTGCTTCTCC 2640 

ACAGTGATGA TTTTCCCCAC ACGGATATCC AGCCX3GGATG GGATGACCTC CrcrGGTTCT 2700 

GAATTCTTGG CAGGCCTTTG GCCATTGGCT TCTGCTTTGA GGGATCTGGG TAGGCAGCGC 2760 

TGGCCAGTTT TTTCAGGGCA GGGGTATTAA ACTTTTCCCG GATTGGATCC AGCAACTTGT 2820 

TCAGTGCGAC TTCAACAGAA TTCTTCAGGT CTCCAGGATG TACAACCTCA GCAGCAAAGT 2880 

CCTTTTCCAG GTCCACGTAA GCTGTGTAGG TTTTGTTTCC ACCCCATTTC TCATCTCGTA 2940 

GGATCACAAA CTCGGACTTA AGGGGAAAAA GGACATGCTT GATGAAGGAC AGAACCCCAT 3000 

TGTTCTCCAC ATTTCCTGGC TCACAGAAGG CCTTCTTCAG TTTTTTCTTC ACATCCTCCT 3060 

TCCGATCAAG GAGATCAATC TTGGACTCCT CTTCTGAAGA GCTCATTTTG CTGCCTGTTA 3120 

ATCCTGGAAC CATAGGATTC ATCAGATGGA CCCGTTTTGA ATAGCCAAGT GCAGGGAGGT 3180 

ACrrCTCTGC AAAGGTGAAA ATCTTTCTCT GATCAATGCC TCCAAATTGG GCATCTACTT 3240 

TTAAATACTC TTCATCCAAA GCCTGCAGTC CGGGGTATAA GAGGCCACTC AGCAAAGGGT 3300 

GCTCCACCTG CTTTACCACC TCAGCTCCAG CCTTCTTGGA ATCGTGCTGT GTGACCACGG 3360 

AGGAGAGTCT GTACACATCT AGTGTGTACT CTTTGCTGAG CTGGTAATCA GTGCCTTTGA 3420 

TGAACTTGAG CTTCTCCAAG GGCACACXIIAA TGCTCTCCAG CATTGCTTTG ATCACATTCT 3480 
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CATAGTAACT GACTCGGAGT TCTAGAAGTT CCCATGGGGC TTTCATGTTA TCCAGGTATG 3540 

CGTGGAGGTC CGCAAACAGA ATTGTTACCT CACACCCTGC CTTTAAGAAG TCTGCAATCT 3600 

TIGACPiTCGG CACAAAGTAA GCCACATGTG GTTTGCCCGT GGTTGCCGTT CCCCAGTAAA 3660 

TTTTAAGTTC CCGCTCCTTC AGTATCTCCT TCAGCTTCTC TTCCCCCAGA ACCTCCTGCA 3720 

GGTTCCGGGT GATAAGGTGC AGTTTCTCTT CAGGGCTGGG AGCGTCCCCC ATGGTCCXXrT 3780 

ACCCXTTGCTT CCCCCGCTCA GCCCGGCACC AGAGCCCCTT CCTGGGTCAC CGTCGCCGCC 3840 

GCXSTGCCXX^G AACTGTCACG CX3AGT 3865 



(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 



AGGGAGAGGA GGAGAGGGGG TCTGCGCGCG GCCGCTACCC AGAAGCCAGC GGACGGCAGC 60 

ACGGAGTGGG CTGTCCCCGA GCCCAGCCCC GAGCGAGCCC CCCCCCCGCC CCCGMAGGAC 120 

GCGCCTYCCA GCCAGCCCGA CTCCTAGGAG GAGGGGAGGC QGGAAAGCAG CTCAAGCCTC 180 

ACCCACCGCC CTGCCCCCAG CCCCGCCACT CCCAGGCTCC TCGGGACTCG GCGGGTCCTC 240 

CTGGGAGTCT CGGAGGGGAC CGNCTGTGCA GACGCCATGG AGTTGGTGCT GGTCTTCCTC 300 

TGCAGCCTGC TGGCCCCCAT GGTCCTGGCC AGTGCAGCTG AAAAGGAGAA GGAAATGGAC 360 

CCTTTTCATT ATGATTACCA GACCCTGAGG ATTGGGQGAC TGGTGTTCGC TGTGGTCCTC 420 

TTCTCGGTTG GGATCCTCCT TATCCTAAGT CGCAGGTGCA AGTGCAGTTT CAATCAGAAG 480 

CCCCQGGCCC CAGGAGATGA GGAAGCCCAG GTGGAGAACC TCATCACCGC CAATGCAACA 540 

GAGCCCCAGA AAGCAGAGAA CTGAAGTGCA GCCATCAGGT GGAAGCCTCT GGAACCTGAG 600 

GCGGCTGCTT GAACCTTTGG ATGCAAATGT OGATGCTTAA GAAAACCGGC CACTTCAGCA 660 

ACAGCCCTTT CCCCAGGAGA AGCCAAGAAC TTGTGTGTCC CCCACCCTAT CCCCTCTAAC 720 

ACCATTCCTC CACCTGATGA TGCAACTAAC ACTTGCCTCC CCACTGCAGC CTGCGGTCCT 780 

GCCCACCTCC CGTGATGTGT GTGTGTGTGT GTGTGTGTGT GACTGTGTGT GTTTGCTAAC 840 

TGTGGTCTTT GTGGCTACTT GTTTGTGGAT GGTATTGTGT TTGTTAGTGA ACTGTGGACT 900 

CGCTTTCCCA GGCAGGGGCT GAGCCACATG GCCATCTGCT CCTCCCTGCC CCCGTGGCCC 960 
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TCCATCACCT TCTGCTCCTA GGAGGCTGCT TGTTGCCCGA GACCAGCCCX: CTCCCCTGAT 1020 

TTAGGGATGC GTAGGGTAAG AGCACGGGCA GTGC5TCTrCA GTCGTCTTGG GACCTGGGAA 1080 

5 GGTTTGCAGC ACTTTGTCAT CATTCTTCAT GGACTCCTTT CACTCCTTTA ACAAAAACCT 1140 

TGCTTCCTTA TCCCACCTGA TCCCAGTCTG AAGGTCTCTT AGCAACTGGA GATACAAAGC 1200 

AAGGAGCTGG TGAGCCCAGC GTTGACGTCA GGCAGGCTAT GCCCTTCCGT GGTTAATTTC 1260 

10 

TTCCCAGGGG CTTCCACX5AG GAGTCCCCAT CTGCCCCGCC CCTTCACAGA GCGCCCQGGG 1320 

ATTCCAGGCC CAGGGCTTCT ACTCTGCCCC TGGGGAATGT GTCCCCTGCA TATCTTCTCA 1380 

15 GCAATAACTC CATGGGCTCT GGG/^CCTAC CCCTTCCAAC CTTCCCTGCT TCTGAGACTT 1440 

CAATCTACAG CCCAGCTCAT CCAGATGCAG ACTACAGTCC CTGCAATTGG GTCTCTGGCA 1500 

GGCAATAGTT GAAGGACTCC TGTTCCGTTG GGGCCAGCAC ACCGGGATGG ATGGAGGGAG 1560 

20 

AGCAGAGGCC TTTGCTTCTC TGCCTACX3TC CCCTTAGATG GGCAGCAGAG GCAACTCCCG 1620 

CATCCTTTGC TCTGCCTGTC GGTGGTCAGA GCGGTGAGCG AGGTGGGTTG GAGACTCAGC 1680 

25 AGGCTCCGTG CAGCTCTTGG GAACAGTCAG AGGTTGAAGG TCATAACGAG AGTGGGAACT 1740 

CAACCCAGAT CCCGCCCCTC CTGTCCTCTG TGTTCCCGCG GAAACCAACC AAACCGTCCG 1800 

CTGTGACCCA TTGCTGTTCT CTGTATCGTG ATCTATCCTC AACAACAACA GAAAAAAGGA 1860 

30 

ATAAAATATC CTTTGTTTCM TAAAAAAAAA AAAAAAAAAA AGGGGGGGGG 1910 



35 

(2) INFORMATION FOR SEQ ID NO: 291: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3276 base pairs 
40 (B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

45 

GCGACCGTCG TTTGAGTCGT CGCTGCCGCT GCCGCTGCCA CTGCCACTGC CACCTCGCGG 60 

ATCAGGAGCC AGCGTTGTTC GCCCGACGCC TCGCTGCCGG TGGGAGGAAG CGAGAGQGAA 120 

50 GCCGCTTGCG GGTTTGTCGC CGCTGCTCGC CCACCGCCTG GAAGAGCCGA GCCCCGGCCC 180 

AGTCGGTCGC TTGCCACCGC TCGTAGCCGT TACCCGCGGG CCGCCACAGC CGCCGGCCGG 240 

GAGAGGCGCG CGCCATQGCT TCTGGAGCCG ATTCAAAAGG TGATGACCTA TCAACAGCCA 300 

55 

TTCTCAAACA GAAGAACCGT CCCAATCGGT TAATTGTTGA TGAAGCCATC AATGAGGACA 360 

ACAGTGTGGT GTCCTTGTCC CAGCCCAAGA TGGATGAATT GCAGTTGTTC CGAGGTGACA 420 

60 CAGTGTTGCT GAAAGGAAAG AAGAGACGAG AAGCTGTTTG CATCGTCCTT TCTGATGATA 480 



wo 98/39448 



520 



PCTAJS98/04493 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



CrrcTTCTGA TGAGAAGATT CGGATGAATA GAGTTGTTCG GAATAACCTT CGTGTACGCX: 540 

TAGGGGATCT CATCAGCATC CAGCCATGCC CTGATGTGAA GTACGGCAAA CGTATCCATG 600 

TGCTCCCCAT TGATGACACA GTGGAAGGCA TTACTGGTAA TCTCrTCGAG GTATACCTTA 660 

AGCCC3TACTT CCTGGAAGCG TATCGACCCA TCCGGAAAGG AGACATTTTT CTTGTCCGTG 720 

GTGGGATGCG TCCTGTGGAG TTCAAAGTGG TGGAAACAGA TCCTAGCX:CT TATTGCATTG 780 

TTGCTCCAGA CACAGTGATC CACTGCGAAG GGGAGCCTAT CAAACGAGAG GATGAGGAAG 840 

AGTCCTTCAA TGAAGTAGGG TATGATGACA TTGGTGGCTG CAGGAAGCAG CTAGCTCAGA 900 

TAAAGGAGAT GGTGGAACTG CCCCTGAGAC ATCCTGCCCT CTTTAAGGCA ATTGGTGTGA 960 

AGCCTCCTAG AGGAATCCTG CTTTACGGAC CTCCTGGAAC AGGAAAGACC CTGATTGCTC 1020 

GAGCTOTAGC AAATCAGACT GGAGCCTTCT TCTTCTTGAT CAATGGTCCT GAGATCATGA 1080 

GCAAATTGGC TGGTCAGTCT GAGAGCAACC TTCGTAAAGC CTTTGAGGAG GCTGAGAAGA 1140 

ATCCTCCTGC CATCATCTTC ATTGATGAGC TAGATGCCAT CGCTCCCAAA AGAGAGAAAA 1200 

CTCA1GGCGA GGTOGAGCGG CGCATTGTAT CACAGTTGTT GACCCTCATG GATGGCCTAA 1260 

AGCAGAGGGC ACATCTGATT GTTATGGCAG CAACCAACAG ACCCAACAGC ATTGACCCAG 1320 

CTCTACGGCG ATTTGGTCGC TTTGACAGGG AGGTAGATAT TGGAATTCCT GATGCTACAG 1380 

GACGCTTAGA GATTCTTCAG ATCCATACCA AGAACATGAA GCTGGCAGAT GATGTGGACC 1440 

TCGAACAGTA GCCAATGAGA CTCACGGGCA TGTGGGTGCT GACTTAGCAG CCCTGTGCTC 1500 

AGAGGCTCCT CTCCAAGCCA TCCGCAAGAA GATCGATCTC AITCACCTAG _AG^^ 1560 

CATTGATCCC GAGGTCATGA ACTCTCTAGC AGTTACTATG GATGACTTCC GGTGGGCXrTT 1620 

GAGCCAGAGT AACCCATCAG CACTGCGGGA AACCGTGGTA GAGGTGCCAC AGGTAACCTG 1680 

GGAAGACATC GGGGGCCTAG AGGATGTCAA ACGTGAGCTA CAGGAGCTGG TCCAGTATCX: 1740 

TOTGGAGCAC CCAGACAAAT TCCTGAAGTT TGGCATGACA CCTTCCAAGG GAGTTCTGTT 1800 

CTATGGACCT CCTGGCTGTG GGAAAACTTT GTTGGCCAAA GCCATTGCTA ATGAATGCCA 1860 

GGCCAACTTC ATCTCCATCA AGGGTCCTGA GCTGCTCACC ATGTGGTTTG GGGAGTCTGA 1920 

GGCCAATGTC AGAGAAATCT TTGACAAGGC CCGCCAAGCT GCCCCCTGTG TGCTATTCIT 1980 

TGATGAGCTG GATTCGATTG CCAAGGCTCG TGGAGGTAAC ATTGGAGATG GTGGTGGGGC 2040 

TOCTGACCGA GTCATCAACX: AGATCCTGAC AGAAATGGAT GGCATGTCCA CAAAAAAAAA 2100 

TGTGTTCATC ATTGGCGCTA CCAACCGGCC TGACATCATT GATCCTGCCA TCCTCAGACC 2160 

TGGCCGTCTT GATCAGCTCA TCTACATCCC ACTTCCTGAT GAGAAGTCCC GTGTTGCCAT 2220 

CCTCAAGGCT AACCTGCGCA AGTCCCCAGT TGCXTAAGGAT GTGGACTTGG AGTTCCTGGC 2280 
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TAAAATGACT AATGGCTTCT CTGGAGCTGA CCTGA.CAGAG ATTTGCCAGC GTGCTTGCAA 2340 

GCTGGCCATC CGTGAATCCA TCGAGAGTGA GATTAGGCGA GAACGAGAGA GGCAGACAAA 2400 

• 5 

CCCATCAGCC ATGGAGGTTAG AAGAGGATGA TCCAGTGCCT GAGATCCGTC GAGATCACTT 2460 

TGAAGAAGCC ATGCGCTTTG CX3CGCCGTTC TGTCAGTGAC AATGACATTC GGAAGTATGA 2520 

10 GATGITTGCC CAGACCCTTC AGCAGAGTCG GGGCTTTGGC AGCTTCAGAT TCCCTTCAGG 2580 

GAACCAGGGT GGAGCTGGCC CCAGTCAGGG CAGTGGAGGC GGCACAGGTG GCAGTGTATA 2640 

CACAGAAGAC AATGATGATG ACXTTGTATGG CTAAGTGGTG GTGGCCAGCG TGCAGTGAGC 2700 

15 

TGGCCTGCCT GGACCTTGTT CCCTGGGGGT GGGGGCGCTT GCCCAGGAGA GGGACCAGGG 2760 

GTGCGCCCAC AGCCTGCTCC ATTCTCCAGT CTGAACAGTT CAGCTACAGT CTGACTCTGG 2820 

20 ACAGGGGGTT TCTGTTGCAA AAATACAAAA CAAAAGCGAT AAAATAAAAG CGAITTTCAT 2880 

TTGGTAGGCG GAGAGTGAAT TACCAACAGG GAA1TGGGCC TTGGGCTATG CCATTTCTGT 2940 

TGTAGTTTGG GGCAGTGCAG GGGACCTGTG TGGGGTGTGA ACCAAGGCAC TACTGCCACC 3000 

25 

TGCCACAGTA AAGCATCTGC ACTTGACTCA ATGCTGCCCG AGCCCTCCCT TCCCCCTATC 3060 

CAACCTGGGT AGGTGGGTAG GGGCCACAGT TGCTGGATGT TTATATAGAG AGTAGGTTGA 3120 

30 TTTA1TTTAC ATGCTTTTGA GTTAATGTTG GAAAACTAAT CACAAGCAGT TTCTAAACCA 3180 

AAAAATGACA TGTTGTAAAA GGACAATAAA CGTTGGGTCN AAATGGGWRA AAAAAAAAAA 3240 

AAAAAAGGGG GGCCCCTCTA AAaSWCCANN CTTCGT 3276 

35 



40 



50 



(2> INFORMATION FOR SEQ ID NO: 292: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

TTGCAATGGT TGAATTCCCC TCCTCACGCC AGCCTAGGAG AAGAAGTTCG TAGTCCCAGA 60 

GGTGAGGCAG GAGGCGGCAG TTTCTGGCGG GTGAGGGCGG AGCTGAAGTG ACAGCGGAGG 120 

CGGAAGCAAC GGTCGGTGGG GCGGAQAAGG GGGCTGGCCC CAGGAGGAGG AGGAAACCCT 180 

55 TCCGAGAAAA CAGCAACAAG CTGAGCTGCT GTGACAGAGG GGAACAAGAT GGCGGCGCCG 240 

AAGGGAGCCT CTGQGTGAGG ACCCAACTGG GGCTCCCGCC GCTGCTQCTG CTGACCATQG 300 

CCTTGGCCGG AGGTTCGGGG ACCGCTTCGG CTGAAGCATT TGACTCGGTC TTGGGTGATA 360 

60 
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CGGCGTCTTG CCACCGGGCC TGTCAGTTGA CCTACCCCTT GCACACCTAC CCTAAGGAAG 420 

AGGAGTTGTA CGCATGTCAG AGAGGTTGCA GGCTGITTTC AATTTGTCAG TTTGTGGATG 480 

ATGGAATTGA CTTAAATCGA ACTAAATTGG AATGTGAATC TGCATGTACA GAAGCATATT 540 

CCCAATCTGA TGAGCAATAT GCTTGCCATC TTGGTTGCCA GAATCAGCTG CCATTCGCTG 600 

AACTGAGACA AGAACAACTT ATGTCCCTGA TGCCAAAAAT GCACCTACTC TTTCCTCTAA 660 

CTCTGGTGAG GTCATTCTGG AGTGACATGA TGGACTCCGC ACAGAGCTTC ATAACCTCTT 720 

CATGGACTTT TTATCTTCAA GCCGATGACG GAAAAATAGT TATATTCCAG TCTAAGCCAG 780 

AAATCCAGTA CGCACCACAT TTGGAGCAGG AGCCTACAAA TTTGAGAGAA TCATCTCTAA 840 

GCAAAATGTC CTATCTGCAA ATGAGAAATT CACAAGCGCA CAGGAATTTT CTTGAAGATG 900 

GAGAAAGTGA TGGCTTTTTA AGATGCCTCT CTCTTAACTC TGGGTGGATT TTAACTACAA 960 

CriCTTGTCCT CTCGGTGATG GTATTGCTTT GGATTTGTTG TGCAACTGTT GCTACAGCTG 1020 

TGGAGCAC3TA TGTTCCCTCT GAGAAGCTGA GTATCTATGG TGACTTGGAG TTTATGAATG 1080 

AACAAAAGCT AAACAGATAT CCAGCTTCTT CTCTTGTGGT TGTTAGATCT AAAACTGAAG 1140 

ATCATGAAGA AGCAGGGCCT CTACCTACAA AAGTGAATCT TGCTCATTCT GAAATTTAAG 1200 

CATTTTTCTT TTAAAAGACA AGTGTAATAG ACATCTAAAA TTCCACTCCT CATAGAGCTT 1260 

TTAAAATGGT TTCATTQGAT ATAGGCCTTA AGAAATCACT ATAAAATGCA AATAAAGTTA 1320 

CTCAAATCTG TGAT^GACTGT ATTTGCTATA ACTTTATTGG TATTGTTTTT GTAGTAATTT 1380 

AAGAGGTGGA TGTTTGGGAT TGTATTATTA TTTTACTAAT ATCTGTAGCT ATTTTGTTTT 1440 

TTGCTTTGGT TATTGTTTTT TTCCCTTTTC TTAGCTATGA GCTGATCATT GCTCCTTCTC 1500 

ACCTCCTGCC ATGATACTGT GAGTTACCTT AGTTAACAAG CTGAATATTT AGTAGAAATG 1560 

ATGCTTCTGC TCAQGAATGG CCCACAAATC TGTAATTTGA AATTTAGCAG GAAATGACCT 1620 

TTAATGACAC TACATTTTCA GGAACTGAAA TCATTAAAAT TTTATTTGAA TAATTAAAAA 1680 

AAAAAAAAAA AANCT 1695 



50 (2) INFORMATION FOR SEQ 10 NO: 293: 

<i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1501 base pairs 
(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linecir 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

60 CACTTTCAGC AGTCCTTTGC TCTCTTTGCT TCTACCTCAA ATAGCCCCAG GAGTGGGCTT 



60 
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TAGTCTCCAA TATGGAGCAT CTCAAGCTTC TCCTGGQGGA TGGGGATTGG GATGGGCAGA 120 

ATCTGTTTTG GWTCTCCXSGG TTATTTCCAG TGGGTGTAAA AGCAGAGCTG GGCCTTTCCC 180 

- 5 

TCTCTTATCC CTGAGGGTGG GTAAGAAGGA CTGTATCTAC ACCTGTTCTT CCCTACCTTC 240 

TCITTTGTTA GGGAGGCCTC ATTCTAAGTT CCTCAAGAGA GTCCTTGGCT TAAAGCTGTA 300 

10 GCAAGGGTGT GCTAGGTGGG GGATTTGGAG CAAAACCX5TC GAGTAGGCAT GATACTGGTA 360 

TGGAGTQGGC CTGCAAAATC AGACAGAAAT GGCTTGAGAA GCCGCAGQGG AGCATGCCTG 420 

TCTCTCAGTG ATAGAGTATG GGAGGGACCT CCCTAGCTTG GAAAATGAGA ATTGAAGQGG 480 

15 

TTATGAACAA ATAGGATGCC TAGTTGAGGA TGTTCCCAAA GTTTTGTCCA ATCTTATCAT 540 

TAGTAGATTT TATAAGCCAC AGAGACAAAC CAGAAACGGA ATAATGTTAC TTTGGATCCT 600 

20 TTATTTTTTT GTTCTAGGTG TGGCTTTGTA CATGCAGAAG AATGCTATAT GCTGCACATT 660 

TTGCCTTTAA AGTCTTACGA CTTTCCCCAT TTTAGTCTAA TGGGAAGATA CAGATGTGCA 720 

AGTCTGCTTT TTrGTTTTTT GTTATTATTT TTTTTTTTTT GCTCTGTGTT ATGGACATTT 780 

25 

TCAGACATGC ACAGAAGTGG AGAGGATGGT CCTTGGACCC MATGTGTCCA TCACCTAGCT 840 

GCATCACTTA TCAGCTATGG TCAACCTGGT TTCATCTGTA TCTCTCTCTT TTCACCTGTA 900 

30 TTGTTTATTG AAAATCCAAG ACACTATGCC AATGCAACCG TGACTACTTT GGGAGATTGG 960 

TAGTCTCTTT TGATGGTGAT AGTGATGGGG TGCACTATCA TT^TCACATC AGGTCTGCTT 1020 

TTTGCTTTTA ATGTTAACTA ATGAAGTTCC AGAGATGGGC CTTAGAAATG TGTTTTAAGA 1080 

35 

ATTAACAAGG AGTCTCAAAA AGAAATGAGA GGGATGCTTC CTTTNCCCTT GCATCTACAA 1140 

AACMAGAGAG AGACTGTTCT GTTGTAAAAC TCTTTCAAAA ATTCTGATAT GGTAAGGTAC 1200 

40 TTGAGACCCT TCACCAGAAT GTCAATCTTT TTTTCTGTGT AACATGGAAA CTTGTGTGAC 1260 

CATTAGCATT GTTATCAGCT TGTACTGGTC TCATAACTCT GGTTTTGGAA GAATAATTTG 1320 

GAAATTGTTG CTGTGTTCTG TGAAAATAAC CTCCCCAAAA TAATTAGTAA CTGGTTGTTC 1380 

45 

TACTTGGTAA TTTGACACCC TGTTAATAAC GCAATTATTT CTGTGTTCTT AAACAGTATA 1440 

AATAGTTGTA AGTTTGCATG CATGATGGAA AAATAAAAAC CTGTATCTCT GTTAAAAAAA 1500 

50 A 1501 



55 (2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2683 bcise pairs 
<B) TYPE: nucleic acid 
60 (C) STRANDEDNESS : double 
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<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

TGANTGTGGT CCCGGGTGCN GATTGGCAGN GCCTCCGCCG CGGCTCGTGG TTGTCCCGCC 60 

ATGGCACTGT CGCGGGGGCT GCCCCGGGAG CTGGCTGAGG CGGTGGCCGG GGGCCGGGTR 120 

CTGGTGGTGG GGGCGGGCGG CATCGGCTGC GAGCTCCTCA AGAATCTCGT GCTCACCGGT 180 

TTCTCCCACA TCGACCTCAT TGATCTGGAT ACTATTGATG TAAGCAACCT CAACAGACAG 240 

TTTTTGTTTC AAAAGAAACA TGTTGGAAGA TCAAAGGCAC AGGTTGCCAA GGAAAGTGTA 300 

CTGCAGTTTT ACCCGAAAGC TAATATCGTT GCCTACCATG ACAGCATCAT GAACCCTGAC 360 

TATAATGTGG AATTTTTCCG ACAGTTTATA CTGGTTATGA ATGCTTTAGA TAACAGAGCT 420 

GCCCGAAACC ATGTTAATAG AATGTGCCTG GCAGCTGATG TTCCTCTTAT TGAAAGTGGA 480 

ACAGCTGGGT ATCTTGGACA AGTAACTACT ATCAAAAAGG GTGTGACCGA GTGTTATGAG 540 

TGTCATCCTA AGCCGACCCA GAGAACCTTT CCTGGCTGTA CAATTCGTAA CACACCTTCA 600 

GAACCTATAC ATTGCATCGT TTGGGCAAAG TACTTGTTCA ACCAGTTGTT TGGGGAAGAA 660 

GATGCTGATC AAGAAGTATC TCCTGACAGA GCTGACCCTG AAGCTGCCTG GGAACCAACG 720 

GAAGCCGAAG CCAGAGCTAG AGCATCTAAT GAAGATGGTG ACATTAAACG TATTTCTACT 780 

AAGGAATGGG CTAAATCAAC TQGATATGAT CCAGTTNAAA CTTTTTACCA AGCTTTTTAA 840 

AGATGACATC AGGTATCTGT TGACAATGGA CAAACTATGG CGGAAAAGGA AACCTCCAKT 900 

TCCGTTGGAC TGGGCTGAAG TACAAAGTCA AGGAGAAGAA ACGAATGCAT CAGATCAACA 960 

GAATGAACCC CAGTTA3GCC "tGAAMACCA GCAGGTTCTA GATGTAAAGA GCTATGCACG 1020 

TCTTTTTTCA TVAGAGCATCG AGACTTTGAG AGTTCATTTA GCAGAAAAGG GGGATGQAGC 1080 

TGAGCTCATA TGGGATAAGG ATGACCCATC TGCAATQGAT TTTGTCACCT CTGCTGCAAA 1140 

CCTCAGGATG CATATTTTCA GTATGAATAT GAAGAGTAGA TTTGATATCA AATCAATGGC 1200 

AQGGAACATT ATTCCTGCTA TTGCTACTAC TAATGCAGTA ATTGCTQGGT TGATAGTATT 1260 

GGAAGGATTG AAGATTTTAT CAGGAAAAAT AGACCAGTGC AGAACAATTT TTTTGAATAA 1320 

ACAACCAAAC CCAAGAAAGA AGCTTCTTGT GCCTTGTGCA CTGGATCCTC CCAACCCCAA 1380 

TTGTTATGTA TGTGCCAGCA AGCCAGAGGT GACTGTGCGG CTGAATGTCC ATAAAGTGAC 1440 

TGTTCTCACC TTACAAGACA AGATAGTGAA AGAAAAATTT GCTATGGTAG CACCAGATGT 1500 

CCAAATTGAA GATGGGAAAG GAACAATCCT AATATCTTCC GA^GAGGGAG AGACGGAAGC 1560 

TAATAATCAC AAGAAGTTGT CAGAATTTGG AATTAGAAAT GGCAGCCGGC TTCAAGCAGA 1620 

TGACTTCCTC CAGGACTATA CTTTATTGAT CAACATCCTT CATAGTGAAG ACCTAGGAAA 1680 
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GGACGTTGAA 
AGATGCTGCC 
CAGCTCAAGA 
ATCCCX5ACGT 
CAGTGCAAAG 
TTGAACAGAA 
GATTGTTATG 
TGCTCCCTTT 
TGAAAGTAGG 
AGATAAAACA 
AGTTTCAAAA 
TTTTGGCTAG 
ATTGAAACAG 
GATGGCACTA 
CGCCTGGTAT 
ATATTCGATG 
TGCATTCTKG 
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TTrGAAGTTG 
AAAAGCATAA 
GCAAGATGAC 
CATGAAGAAG 
AGGTCACGTA 
ATGCCTCTAA 
TCCTTTGTTC 
GAAAGCATTC 
AATATAGTTT 
ACACAATGCA 

AAGAAGAATT 
ATTCAAAGAA 
TGTATATTAA 
GTCTGTGTAA 
TTTTGTAAAA 
TKAAAAAAAN 



TTGGTGATGC 
CCAATGGGCA 
GTTCTCATAG 
AGAGAAGCCG 
TAGAACAGAA 
ACAGAACCCT 
CAAAGGGAAA 
ATTTTGCTAG 
TAAAAACCCT 
TGTTGCCTTT 
TAGTAAAGTT 
ATTTTTAGCC 
GTATCGAGTG 
TGTAAAACAA 
GAAGCCAATT 
CTCAAATAAC 
NACAGAAAAA 



525 

CXrCGGAAAAA GTGGGGSCCA 
GTGATGATGG AGCTCAGCCC 
TTGATTCGGA TGAAGAAGAT 
CAAGAGGAAA TTAGATGAGA 
GGAAGAGCTT GATGATGTCA 
CTTACTATTT AGTTTATCTG 
AAATTGACAG CAGTGACTTG 
AACTGTTAGA CACATTGCAG 
TTGAACAAAG TGTGTCCATA 
TTAATGTAAA TACCCTTAGG 
GATACCTGGT TATAAATATT 
TAGATCTAAC CATTTTCATA 
CTATGCATTG AAACTTGTTT 
TGTTAATTTA CTCAAGITTT 
TTTGTGTATT GTTACAGTTT 
GACTATACTT ATGGACCAAA 
AAAAAAAACA AGA 



AACAAGCTGA 
TCCACCTCCA 
TCTTCAAATA 
AAGAGAATCT 
TAGCATTAGA 
GGCAGAACCA 
AAAATGATTC 
TATGCTGTAT 
ACCAGTCATG 
TATCATTAAT 
ATGCCTTTAT 
CTCTTAACTG 
TTAAATGTTA 
CAGTTTGTAC 
CAGGTTATTT 
TAAATGGCAY 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2683 
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(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 1454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIXX3Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

GGACTCGGGG TGGCTCTAAG GGGCAGGGAT AGGGCTGGGG AGCGCCGGCC TGTGGCCCTG 60 

ACCAGCCCCT TCTCGTGCAG GTTCCACCCC GATGCAGGTG GTCACGTGCT TGACGCGGGA 120 

CAGCTACCTG ACGCACTGCT TCCTCCAGCA CCTCATGGTC GTGCTGTCCT CTCTGGAACG 180 

CACGCCCTCG CCGGAGCCTG TTGACAAGGA CTTCTACTCC GAGTTTGGGA ACAAGACCAC 240 

AGGGAAGATG GAGAACTACG AGCTGATCCA CTCTAGTCGC GTCAAGTTTA CCTACCCCAG 300 

TGAGGAGGAG ATTGGGGACC TGACGTTCAC TOTGGCCCAA AAGATGGCTG AGCCAGAGAA 360 

GGCCCCAGCC CTCAGCATCC TGCTGTACGT GCAGGCCTTC CAGGTGGGCA TGCCACCCCC 420 
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TGGGTGCTGC AGGGGCCCCC TGCGCCCCAA GACACTCCTG CTCACCAGCT CCGAGATCTT 480 

CCTCCTGGAT GAGGACTGTG TCCACTACCC ACTGCCCGAG OTTGCCAAAG AGCCGCCXXIA 540 

GAGAGACAGG TACCGGCTGG ACGATGGCCG CCGCGTCCGG GACCTGGACC GAGTGCTCAT 600 

GGGCTACCAG ACCTACCXXK: AGCCCTCACC CTCGTYTTCG ATGACGTGCA AGGTCATGAC 660 

CTCATGGGCA GTCTCACCCT GGACCACTTT GGGGAGGTGC CAGCTGGCCC GGCTAGAGCC 720 

AGCCAGGGCC GTGAAGTCCA GTGGCAGGTG TTTGTCCCCA GTGCTGAGAG CAGAGAGAAG 780 

CTCATCTCGC TGTTGGCTCG CCAGTGGGAG GCCCTGTGTG GCCTGAGCTG CCTGTCGAGC 840 

tcaccggcta gcccaggcca cagccagcct gtcgtgtcca gcctgacgcx: TACTQGGGCA 900 

GGGCAGCAGG CrTTTGTGTT CTCTAAAAAT GTTTTATCCT CXrCTTTOGTA CXnTAATTTG 960 

ACTGTCCTCG CAGAAATGTG AACATGTGTG TGTGTTGTGT TAATTCTTTC TCATGTTGGG 1020 

ACyrGAGAATG CCGGGCCCCT CAGGGCTGTT CXSGTGTGCTG TCAGCCTCCX: ACAGGTGGTA 1080 

CAGCCGTGCA CACCAGTGTC GTGTCTGCTG TTGTGGGACC GTTGTTAACA CX5TGACACTG 1140 

TGGGTCTGAC TTTYTCTTCT ACACGTCCTT TCCTGAAGTG TCGAGTCCAG TCCTTTGTTG 1200 

CTCTTCCTGT TOCTGTTGCT GTTGCTGTTG GCATCTTGCT GCTAATCCTG AGGCTGGTAG 1260 

CAGAATGCAC ATTGGAAGCT CCCACCCCAT ATTGTTCTTC AAAGTGGAGG TCTCCCCTGA 1320 

TCCAGACAAG TGGGAGAGCC CX3TGGGGGCA GGGGACCTGG AGCTGCCAGC ACCAAGCGTG 1380 

ATTCCTGCTG CCTGTATTCT CTATTCCAAT AAAGCAGAGT TTGACACCGW MAAAAAAAAA 1440 

AAAAAAAAAA AACN . . , . 1^54 
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(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 
ACCCTGGCAT GCCCCACAAA CAGATCACCA GCCAGCTTAC ACAGGCATTA 
ATGAGGAAGA ATCATTCACA ACTGAGCAAG ACATTCATAT GATCATTTAA 
CCCTTATGTG TTAGCAAGTA TAATCGGCTA ACTCCTAAAT CCCAATGAAT 
TGGACAGCAA TQGGCTGCAA TTAGGCAGAT AAAGACATCA GTCCCAGTAA 
AGACTCATCT AGCACCAACT ACCATTAGCA CTATGTTAGG AGCTGCAAGG 



ACTCTCCTCA 
GGAAGTGTTT 
AGTCCTAGGC 
ATGAATCCAT 
CCCCAAAGTA 



60 
120 
180 
240 
300 
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GAAGATGTGC ATAATGTCTG CTCTTGTGTA GCTCAGGAGA CAATTCCAGC ACAGACACTA 360 

CAGTTAACGC TGAACTGCAG CTGCAAGTAA TAGCAWGAAC AGTCAGAAAA ATACCTTATG 420 

AGGGGGCAGG GCTGAAGCTG GGCCTTGAAG GATGGATGAA ATTTGGATAG AGAATGAGGA 480 

AGACAGAGGG NCTCCAAGTG AGAGAAGCAT GAAAAATGAG CARGGGCCTG GATCAGTGGG 540 

GTGTATTCAG AGCACCTYTC CAGATGCACC ATGCATGCTC ACAGTCCCTT GCCTATGTGT 600 

GGCAGAGTGT CCCAGCCAGA TGTGTGCCCC CACCXXZATGT CCATTTACAT GTCCTTCAAT 660 

GCCCACCTCA AAAGGYACYT CTTCTGTAAA GCTTTCCCTK GGTATCAGGA ATCAAAATTA 720 

15 ATCAGGGATC TTTTCACACT GCTGTTTTTT CCTCTTTGGT CCTTCTATCA CTAAAACTCA 780 

TCTCATTCAG CCTTACAGCA TAACTAATTA TTTGTTTTCC TCACTACA 828 



10 



20 



35 



(2) INFORMATION FOR SEQ ID NO: 297: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 2416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

TCAATTTCCA TTAACTCAGA TCAGCCATTG TGATTCACCA TTTGrCAGGC TCTCAGGTTT 60 

AACAAAACCT ACTATCACCA TCATCCTTCA ACAGCCACAG TCTGAATTGA GCCAACATTT 120 

'l-riTX TC ITr GAGAAAGAAG TGGACTGGGG CACAACTTTT AGTCTGAGOG GAGCTAGTGG 180 



AAATCTAGAC AATAGAAGTC ATCGATAGCA GCTTTTCCTC AAATGTGTGA CTCCTCAGGG 240 

40 GCTAAACTGC TCTTAGCTTA GAATTATGCT TTACTAGAGA TCTAGCAGAT AAGTGGGTTA 300 

ATCACTACCA TCCTGTAACT AGTTATATAG CTTCCAGACA TGAGGGAGAC ATCAAACAGG 360 

GATGGAAGCA ACCCCAAGGA TATGCAAGAA GGGCATGATG AACCCCCTTC CCTCTGGCAG 420 

45 

GAGAACAAGG CCAACCAAGG GACAGACTGG AAAGCACTTA GATGTTTAAG GAGGAGAAAG 480 

GGGAAGCTTT GACCAGTCCT TGCCTTTTGC CAAGTTCAGC CAGTTCTCCG CTGCTTGCAA 540 

50 CCTCTAGCGC AGTAACATTT GCAGAATTGC AGATTTTCCC CCAGATACTA GGAGGAAAGG 600 

GACTTTGGGG GGTGGGGAAG GGGTCGTGGT GTTTTAAAAG CATAAGTTAC CTGTTTGCAC 660 

TGTTTTAAGA TAGGAAAAAA AAATAGTGGG CAAGGTGAAC ATCAGACGTA AATTTGTGTG 720 

55 

TTTTTATTTT GTCATGCTCT TGAA7ATGTT TGACCATTTG TAGTATACAC AGTGAAACTT 780 

GATTCTCTGT TGCATAAAAC ACTATATTTT TTTGGAAATG TTACTGTCCA AAAGCCTCTT 840 

60 CCCTCCCTTT CCTTTTCCTA TGTACTTCCT TCATACTTGC TTTACTGATC AGCCAGGCAA 900 
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TAGCCATCCA AGAGCTAGAG CATGAAACAG GGCCCTTTCC AAGTAGGCTC TGGGTGTCCT 960 

AAGCCAGCGT GTGCCCTCTG GTTTAGTGAG TGTAATAGAG TCCCTGGCAC CTTTCTTTGC 1020 

5 

AAATGAGGCT AACAGACCAG ACTGCAGCAA GTTATCAGAT TCCTCAATCA GATGCACTAG 1080 

GAGTGAGGAG CCXIAGGGATG GAGGGGGTTC CTGAAGTATT GCAGTTGGCT GTAGTAGCTG 1140 

10 AGTTCTTTTC CATGTTACCG AAACTGTAGC CAGTTACAGT TTACTCAGGA AAACQGTAGA 1200 

TCAATTCAGC CATGGTAGTG CTGGTTGGCA GGGATTQGTA ACGGAGAGAA CTGCTCATCA 1260 

GCCAAAACTC AAGCCTTGCC TTTTAGGAGG CCACCAGCAG AGGGACTTQG TCCTCCTTGT 1320 

15 

CTGGTACTTG TGTACATGCX: GGTGACCTGA GGACTCCACT CACACTGGCX5 AGCAAAAAGG 1380 

GAGCAGTGAT TCTCTTTTCT CTCCCCACCC CCTGCCCTTT GTTACCAACA CXrAGTTTCCC 1440 

20 AGGGGGTACA TGAGTTTCTG AATTTTTAAA AAATGTTTTT GGTTTGGTTT TTCTGGGGAC 1500 

TGATAAGTGC TTTAAGCAAT GTCCATACCC CGTCAAGACT CCCAGCTTAG TCATTTTCTT 1560 

GTAnTTTCT GTrCACAGTA TTTGTGTGTG TGCTTGTTTT GGCAGCTCAT TTTGGCTGTA 1620 

25 

TTATATATTG AGTGATGAAT TGATCCTCTT TTTTCCCTAA GGGATATGAA TTGTTTTTCT 1680 

TGTGTTATAT TCTGCTTGTG AATAGCTGGA GCAAACCTGG GGCTGACACX3 CGTAAGSTAG 1740 

30 GGCTGCAAAR CGAGAAGAGA GCCGGTGGAG TGTACTTGTC CCTGACAGGC TGACCTACCT 1800 

GAGTCTCTGA GCTTTTCAGT CCAAATCTTT GCAAGGCTCA AAATGCCACA GAACCTCTCC 1860 

TCTTCTCCCC ACTCCCCATG GCAGGGACCG GACCATCCCT ACATCCAACA TGCTGTTCCT 1920 

35 

. . _CCAGCCCCTC CXATTGCCAT J3GCAAAACAG- GTAOriOTGG^ G CATTACATQG 

GATGCTTGTG TAATCGACCA CCTAGCCTTC TCTCTCCXXTT CCCGTCCTCC CCCAGAATCA 2040 

40 CTTCCTAGGA CACCCGAGCT GCTTGCCCAG QGTCCTGTTT CCCTGCTAAC TCCAGAGAAG 2100 

CATCCCAGGG CTTTGTGACA GTCTCTAATT CCCTTCCCTT CTCGTTAAGA ATCATATTGT 2160 

ATAGTAGCTT TCAGACCATA CAGTATTCAT TGGGTTACTC CTATTATTAT CAAGTAGCTG 2220 

45 

GAATTGTGAA GGTCGGAGTA GTTAGATCTT TAGCTTTTAT TCCTTATTTT TrTGTATTAC 2280 

TCTCCATGTG TATAAATTAT TGATCATGTT GCTGGCTTTT ATAAACTCTA AGCGAAGGAG 2340 

50 GAGCACTGCC TCAGCCTTTG CACATGGTAA TGAAGCACTG TTTTTAAATA AAAGRGRGAA 2400 

MCHCCAAAAA AAAAAA 2416 



55 



(2) INFORMATION FOR SEQ ID NO: 298: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 545 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 

-5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 

GAATTCGGCA CGAGCCATGC YTGGCCTCTC CTTGATTCTT ACAGTCACTT TGTTGGCTGT 60 

TTCTGACTCA GCAGCTACCT GCATTGTGGC CAAAGGATGA CCTATTCCTT CTCAGGAGGG 120 

10 

CAAAAATGTG GAATAGTGTC TGTCCATGCC TCTCCTCATG GGCTACCACC TCTGCCACCG 180 

TGGTTAATCA GTAACAACCA GGAGAGAAGC TGCTGGAACT GACCTCTGGG AACTCCCTGG 240 

15 ATGGTTTGGT GCAGGAATGT AGTAGGCATA CACGTGGTTG CGTQGATCTG GGCCCTCCTG 300 

ATGTGAGTAG AGAGGTAAAA GGSCACCATC TCCTTGACCT YTGGGGAACT CATCCACAAA 360 

GAAGATGTTT CCAAGATGCT TCTGAAGATT GSCTAAAAAT AGCCGGOTTC CACCCCCGTG 420 

20 

AATGCATCCA TTCTAGAATG CTCCTTCACC AGGACCAGAG AACTGAOTTA CAGAAGTGAC 480 

ATGAAAACAT TCCATCCCAG AATTTGCANT ACCTCAAATT NAATTTCTAC CTATTAAAAA 540 

25 NAAAA 545 



30 (2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1530 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

40 GGCTCTGCTG GGCATCATAC TTGTCACTGG GTAAACAGTT TGCCCACTTA CCGCAGATGA 60 

AGCTGCTTGC CAGGGCTCTC CGGCTCTGTG AGTTTGGGAG GCAGGCATCT TCCAGGAGGC 120 

TGGTGGCTGG CCAGGGATGT GTGGGGCCCC GGCGAGGGTG CTGCGCTCCC GTCCAGGTGG 180 

45 

TTGGGCCCAG GGCTGATCTC CCACCCTGTG GAGCCTGCAT TACTGGAAGG ATCATGCGGC 240 

CAGATGATGC CAACGTGGCC GGCAATGTCC ACGGGGQGAC CATCCTGAAG ATGATCGAGG 300 

50 AGGCAGGCGC CATCATCAGC ACCCGGCATT GCAACAGCCA GAACGGGGAG CGCTGTGTGG 360 

CCGCCCTGGC TCGTGTCGAG CGCACCGACT TCCTGTCTCC CATGTGCATC GGTGAGGTGG 420 

CGCATGTCAG CGCQGAGATC ACCTACACCT CCAAGCACTC TGTGGAGGTG CAGGTCAACG 480 

55 

TGATGTCCGA AAACATCCTC ACAGGTGCCA AAAAGCTGAC CAATAAGGCC ACCCTGTGGT 540 

ATGTGCCCCT GTCGCTGAAG AATGTGGACA AGGTCCTCGA GGTGCCTCCT GTTGTGTATT 600 

60 CCCGGCANGA GCAGGAGGAG GAGGGCCGGA AGCGGTATGA AGCCCAGAAG CTGGAGOGCA 660 
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TGGAGACCAA GTGGAGGAAC GGGGACATCG TCCAGCCAGT CCTCAACCCA GAGCXZGAACA 720 

CTGTCAGCTA CAGCCAGTCC AGCTTGATCC ACCTGGTGGG GCCTTCAGAC TGCACCCTGC 780 

- 5 

ACGGCTTTGT GCACGGAGGT GTGACCATGA AGCTCATGGA TGAGGTCX5CC GGGATCGTGG 840 

CTGCACGCCA CTGCAAGACC AACATCGTCA CAGCTTCCGT GGACGCCATT AATTTTCATG 900 

10 ACAAGATCAG AAAAGGCTGC GTCATCACCA TCTCGGGACG CATGACCTTC ACGAGCAATA 960 

AGTCCATGGA GATCGAGGTG TTGGTGGACG CCGACCCTGT TGTGGACAGC TCTCAGAAGC 1020 

GCTACCGGGC CGCCAGTGCC TTCTTCACCT ACGTGTCGCT GAGCCAGGAA GGCAGGTCGC 1080 

15 

TGCCTGTGCC CCAGCTGGTG CCCGAGACCG AGGACGAGAA GAAOCGCTTT GAGGAAGGCA 1140 

AAGGGCGGTA CCTGCAGATG AAGGCGAAGC GACAGGGCCA CGCGGAGCCT CAGCCCTAGA 1200 

20 CTCCCTCCTC CTGCCACTGG TGCCTCGAGT AGCCATGGCA ACGGGCCCAG TGTCCAGTCA 1260 

CTTAGAAGTT CCX:CCCTTGG CCAAAAACCC AATTCACATT GAGAGCTGGT GTTGTCTGAA 1320 

GTTTTCGTAT CACAGTGTTA ACCTGTACTC TCTCCTGCAA ACCTACACAC CAAAGCTTTA 1380 

25 

TTTATATCAT TCCAGTATCA ATGCTACACA GTGTTGTCCC GAGCGCCGGG AGGCGTTGGG 1440 

CAGAAACCCT CGGGAATGCT TCCGAGCACG CTGTAGGGTA TGGGAAGAAC CCAGCACCAC 1500 

30 TMATAAAGCT GOTGCTTGGC TGGGGAAGNA 1530 



35 (2) INFORMATION FOR SEQ ID NO: 300: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 997 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 

45 AGGTAGTGAG AGACACATTA CACCTAACCA ACAAGAAGAA GGATCCTCCC CCTTATAATT 60 

TAACTATGTT TACAGGGAAT GCGTACATTG TGGCTTCCCG AGNATTTCGT CCAACATGTT 120 

TTGAAGAACC CTAAATCCCA ACAACTGATT GAATGGGTAA AAGACACTTA TAGCCCAGAT 180 

50 

GAACACCTCT GGGCCACCCT TCAGCGTGCA CGGTGGATGC CTGGCTCTGT TCCCAACCAC 240 

CCCAAGTACG ACATCTTCAG ACATGACTTC TATTGCCAGG CTGGTCAAGT GGCAGGGTCA 300 

55 TGAGGGAGAC ATCGATAAGG GTGCTCCTTA TGCTCCCTGC TCTGGAATCC ACCAGCQGGC 360 

TATCTCCGTT TATGGGGCTG GGGACTTGAA TTGGATGCTT CAAAACCATC ACCTGTTGGC 420 

CAACAAGTTT GACCCAAAGG TAGATGATAA TGCTCTTCAG TGCTTAGAAG AATACCTACG 480 
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TTATAAGGCC ATCTATGGGA CTGAACTTTG AGACACACTA TGAGAGCGTT GCTACCTGTG 540 

GGGCAAGAGC ATGTACAAAC ATGCTCAGAA CTTGCTGGGA CAGTGTGGGT GGGAGACCAG 600 

5 GGCTTTGCAA TTCGTGGCAT CCTTTAGGAT AAGAGGGCTG WTATTAGATT GTGGGTAAGT 660 

AGATCTTTTG CCTTGCAAAT TGCTGCCTGG GTGRATGCTG CTTGTTCTCT CACCCCTAAC 720 

CCTAGTAGTT CCTCCACTAA CTTTCTCACT AAGTGAGAAT GAGAACTGCT GTGATAGGGA 780 

10 

GAGTGAAGGA GGGATATGTG GTAGAGCACT TGATTTCAGT TGAATGCCTG CTGGTAGCTT 840 

TTCCATTCTG TGGAGCTGCC GTTCCTAATA ATTCCAGGTT TGGTAGCGTG GAGGAGAACT 900 

15 TTGATGGAAA GAGAACCTTC CCTTCTGTAC TGTTAACTTA AAAATAAATA GCTCCTGATT 960 

CAAAGTAAGG AAAAARAAAA AAAGAAAAAA AACTCGA 997 



20 



(2) INFORMATION FOR SEQ ID NO: 301: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 2345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

TTGAGGCCGA CGCTAGGGGC CCGGAAGRAA ACTGCGAGGC GAAGGTGACC GGGGACCX5AG 60 

CATTTCAGAT CTGCTCGGTA GACCTGGTGC ACCACCACCA TGTTGGCTGC AAGGCTQGTG 120 

35 

TGTCTCCGGA CACTACCTTC TAGGGTTTTC CACCCAGCTT TCACCAAGGC CTCCCCTGTT 180 

GTGAAGAATT CCATCACGAA GAATCAATGG CTGTTAACAC CTAGCAGGGA ATATGCCACC 240 

40 AAAACAAGAA TTGGGATCCG GCGTGGGAGA ACTGGCCAAG AACTCAAAGA GGCAGCATTG 300 

GAACCATCGA TGGAAAAAAT ATTTAAAATT GATCAGATGG GAAGATGGTT TGTTGCTGGA 360 

GGGGCTGCTG TTGGTCTTGG AGCATTGTGC TACTATGGCT TGGGACTGTC TAATGAGATT 420 

45 

GGAGCTATTG AAAAGGCTGT AATTTGGCCT CAGTATGTCA AGGATAGAAT TCATTCCACC 480 

TATATGTACT TAGCAGGGAG TATTGGTTTA ACAGCTTTGT CTGCCATAGC AATCAGCAGA 540 

50 ACGCCTGTTC TCATGAACTT CATGATGAGA GGCTCTTGGG TGACAATTGG TGTGACCTTT 600 

GCAGCCATGG TTGGAGCTGG AATGCTGGTA CGATCAATAC CATATGACCA GAGCCCAGGC 660 

CCAAAGCATC TTGCTTGGTT GCTACATTCT GGTGTGATGG GTGCAGTGGT GGCTCCTCTG 720 

55 

ACAATATTAG GGGGTCCTCT TCTCATCAGA GCTGCATGGT ACACAGCTGG CATTGTGGGA 780 

GGCCTCTCCA CTGTGGCCAT GTGTGCGCCC AGTGAAAAGT TTCTGAACAT GGGTGCACCC 840 

60 CTGGGAGTGG GCCTGGGTCT CGTCTTTGTG TCCTCATTGG GATCTATGTT TCTTCCACCT 900 
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45 



50 



ACCACCGTGG CTX3GTGCCAC TCrTTACTCA GTGGCAATGT ACGGTGGATT AGTTCTTTTC 
AGCATGTTCC TTCTGTATGA TACCCAGAAA GTAATCAAGC GTGCAGAAGT ATCACCAATG 
TATGGAGTTC AAAAATATGA TCCCATTAAC TCGATGCTGA GTATCTACAT GGATACATTA 
AATATATTTA TGCGAGTTGC AACTATGCTG GCAACTGGAG GCAACAGAAA GAAATGAAGT 
GACTCAGCTT C1GGCTTCTC TGCTACATCA AATATCITGT TTAATGGGGC AGATATGCAT 
TAAATAGTTT GTACAAGCAG CrTTCGTTGA AGTTTAGAAG ATAAGAAACA TGTCATCATA 
•nTAAATCTT CCGGTAATGT GATGCCTCAG GTCTGCCTTT TTTTCTGGAG AATAAATGCA 
GTAATCCTCT CCCAAATAAG CACACACATT TTCAATTCTC ATGTTTGAGT GATTTTAAAA 
TGTTTTGGTG T^TGTGAAAA CTAAAGTTTG TGTCATGAGA ATGTAAGTCT TTTTTCTACT 
TTAAAATTTA GTAGGTTCAC TGAGTAACTA AAATTTAGCA AACCTGTGTT TGCATATTTT 
TITGGAGTGC AGAATATTGT AATTAATGTC ATAAGTGATT TGGAGCTTTG GTAAAGGGAC 
CAGAGAGAAG GAGTCACCTG CAGTCTTTTG TTTTTTTAAA TACTTAGAAC TTAGCACTTG 
TCTTATTGAT TAGTGAGGAG CCAGTAAGAA ACATCTGGGT ATTTGGAAAC AAGTGGTCAT 
TGTTACATTC ATCTGCTGAA CTTAACAAAA CTGTTCATCC TGAAACAGGC ACAGGTGATG 
CATTCTCCTG CTGTTGCTTC TCAGTGCTCT CTTTCCAATA TAGATGTGGT CATGTTTGAC 
TTGTACAGAA TCrTAATCAT ACAGAGAATC CTTGATQGAA TTATATATGT GTGTnTACT 
TTTGAATGTT ACAAAAGGAA ATAACTTTAA AACTATTCTC AAGAGAAAAT ATTCAAAGCA 
TGAAATATGT TGCTTTTOC AGAATAC;^ CAGTATACTC ATCATTGCTA AGTGTTTTTT 
TATTTTTGCA TATTTATTGA ACTGTCTAAT TGAATACAGC TTGCTCTTGT CACCTCTTCA 
AGCTTTCAAG CCTTTATAGA AAAGCTTCTT TGTGGCTTAC ACTGGAAATT ATGAAAGCAG 
TTTITCTCCT AAGACTTTTG GTTTCTCGCA TTGCCTCTCA GACTAAGCAC TAAAAAGCAA 
AGCAAAACAG AACTAGTNCT GTCTTAATGA AATATATCAA CXCA/^AAGTG TAATGAGGAA 
AATGCTTCAT TAGTTTCCCC TAGCAGACTT ITACTTCTCT TACACTGCTA CACCATTACT 
TTCITGAGAC ATTIOTAAGT CCTTTGATAC AGAAGAGTTA TATTTAGGAG GNCTTTAATG 
AAGGG 



960 
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55 (2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2369 base pairs 

(B) TYPE: nucleic acid 
60 <C) STRANDEI»«ESS: double 
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533 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTICW: SEQ ID NO: 302: 

- 5 v i ' n ' rTvriT u ' X ' i ' i ' i ' ri ' rri ' tttttncaag atcattgttt atttattact tcagataaaa 60 

AGATAGTATA CATATTAGGG AATCCCTTAA AATTCAACTC TAGAGTTATA CACCATCTAG 120 

TACTTTTGCA ATGAATGTTA ACAACAACAA AAAAAATCTC TAAACACCTG AAAGCCCCAC 180 

10 

TATTAACATG GACTATGGTA ATAAAAAATT TTGACATTTA ATTTGTTCAA CATATAGTAT 240 

TTACATTATG AAACCAATGG TGATGATACA ATAAAGTGAT AAAGAAATAG TAAAAATAAA 300 

15 CTTTAAAAAG CAAAGGTTTA TAGTCTGACA ATGCTAATTA TCCTAATTGT ATATAAAAAA 360 

TTAAAACATA GAGCTTTCTG TTACAAAATT CTTAATCCTC TGGGTTGTAA TCATTACTTG 420 

CTACCAATTT ACATGCAACA TCTGCTAGGA CTGACATTTG ATTTTTTTCC CCAAGAATGT 480 

20 

GTGAGTAGAT AAATGACATT TCAGAGCAGA TATTAATTTA CTTGTGGACA GAAAAAGAAA 540 

CTCAAGATTG GTACTGGTCA CAAGCCTCTT CCCAATAGAA ATTATAAAAA CAGTAAGATA 600 

25 AAATTTAAAA AAAATCTAAA AAGGGGATGC ATAGGCAAAG AGTACCATAA ATGGCACAGC 660 

TCAAAAAATC CCAGGACCAA TCAGACACAC ATCTTTTCTC TCTCCTTCAG CGACAAGAGG 720 

tcgahttgc CATCAAATAA CCATGATTGA AGCAAGCGAG GGGCACCAGG TGTACAACTG 780 

30 

ATTAGATCTT GCAAAATACT AAGATGGGAG CAGGGGTGGC CAGAAGAAGG GGTAATTTAT 840 

ATATAATTCA AACTATATAC AGCATAAATG GAATGCAGCC CATCCCAAAC TGGCTCTGrG 900 

35 AAACAATTGG ACCTTTATAG TTAAAATTAT AACAAGTGTA ATAATACAAT AGATTTACAT 960 
" ■ "GGGAAGCAAA ATCOWVGGGA CATTIT'A TASGTATMA" CTGTGCTGTT^ ' ' 102X> 

AATAATTTTG CTAAGTATAC ATCTCAACTG AAGTCTATGT AAAAAATGTC CTAATAGATA 1080 

40 

CAGATATTTA CCTTTGGTGA GTTGAAGGCC TTTTTGTGAC TTCTGTCTGA ACTGTAGGCA 1140 

GAAIGCTAGA TGTACATGCA CATATGGAGA AACTCAAGCT GAGGTCATCC AAAAGCTGTG 1200 

45 CGTATGAGGA GGCTGGAGGT ACTTTGAAAG TCAAAGTAGA CCAGAAACCC AAAACAGGTA 1260 

ACAGTGAGGA TGGCAACAGG GAATGGAATG CCAATATGGC AGTAAAACTT TTTTTAAAAA 1320 

CAGAAAGAGG AAGGCCTCTC GTACCAGCAG AATCCTGrTAC ACGTACAAAA AAGAAAAAGC 1380 

50 

CACCCACCAT TTTGTAAAAC AGAAGCCAAT TATAGTGTGG GAAAGTACAA ATTACAGAAA 1440 

ACCAGAAGTC AACAGAAGAA AAACTACTGG TTTACTTGAG AGAAAGGAGA ATGGTTCACC 1500 

55 CCGAGCAGAG TTACTTGGTG AACGCCGCCA CCACCGCCCA CAGAACCTCA TTGGrGTTGG 1560 

CCTTCAGACA TTCCACTTCA GGGTCTAAGT CGAGAARNTG CCGCACTCTC TTGGTAGCCA 1620 

AATCATACTG CTCGTCCT^GA AGAGGAGCAA AAGCATTCTC CAGGACGTCC GAGGCATGAG 1680 

60 
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CCAGGTAAAT GAGGGCCAGC AAGCGCCTGT CCATGCGGTG AGGGTCATTC ACCGATTTGr 
CAAGAACGGC TTCCTGTACT TTCTTGATGA GGCGCTGCTT AATGTTGTTA TTGGTGAGGG 
5 GATGTGTTGT CATGTCAAAA AGTAGGAAGT TCTGTTTCTC TCTTGTCAAT ACACCCTTTT 
CCACCAGGTT TTTAGCTAAT CGrrTCCCX5TA CATTTCTTAA CTGATAATGC AATTTTAATG 
GATTCCATGT CTCACCACTA AGTAATTCAA TCCAGTTCTG GACCGTTTCT GGAGGCTGAG 

10 

TTTCCTTAAC ATGCTTCAGA GCTTCATCAA GAAGAACATC CCCTGTTGGA GCATCTGACT 
TACAGATTAC CTTTCTTGTT AATAGACTTT TACGTCTCAT TCCACAAGCC TCTAGTTGTA 
15 ACCTTCCTCT CAATGCTAAT TCAATTAACA TACAGCCACG TAATCCAGAT GATATACAGT 
CATTCCAAAA TGATGTGTAA ACCTTCGCGG TCCTTGAGGC CCAGCAGGAG CACTTCCTCC 
ATCAGGGTCA GCCGCGTTTC CTTGGAGTCG CCCTTGTCGT CX5TCGTCCTG CTCGTCGCGG 

20 

CGGCTCTOCG CGTCGTCCTC GCTGCTAGCC GCGCCGCCGC CCGCCGCCCG CTCCTTGTCG 
GCGGCGTTGC GQGAGGCCTC GGTGCGCCG 

25 



(2) INFORMATION FOR SEQ ID NO: 303: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEE»IESS : doiable 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 



GGGACGTGTG GTTTCAGCTC GTGCGCCTCC CCGTGQGTTT GCGACGTTTA GCGACTATTG 60 

40 CGCCTGCGCC ACGCCGGCTG CGAGACTGGG GCCGTGGYTG CTGGTCCCGG GTGATGCTAG 120 

GCGGCTCCCT GGGCTCCAGG CTGTTGCGGG GTGTAGGTGG GAGTCACGGA CGGTTCGGGG 180 

CCCGAGGTGT CCGCGAAGGT GGCGCACATG GGCQGCAGGG GAGAGCATGG CTCAGCGGAT 240 

45 

GGTCTGGGTG GACCTGGAGA TGACAGGATT GGACATTGAG AAGGACCAGA TTATTGAGAT 300 

GGCCTGTCTG ATAACTGACT CTGATCTCAA CATTTTGGCT GAAGGTCCTA ACCTGATTAT 360 

50 AAAACAACCA GATGAGTTGC TGGACAGCAT GTCAGATTGG TGTAAQGAGC ATCACGGGAA 420 

GTCTGGCCTT ACCAAGGCAG TGAAGGAGAG TACAATTACA TTGCAGCAGG CAGAGTATGA 480 

ATXTCTGTCC TTTGTACGAC AGCAGACTCC TCCAGGGCTC TGTCCACTTG CAGGAAATTC 540 

55 

AGTTCATCAA GATAAGAAGT TTCTTGACAA ATACATGCCC CAGTTCATGA AACATCTTCA 600 

TTATAGAATA ATTGATGTGA GCACTGTTAA AGAACTGTGC AGACGCTGGT ATCCAGAAGA 660 

60 ATATGAATTT GCACCAAAGA AGGCTGCTTC TCATAGGGCA CTTGATGACA TTAGrGAAAG 720 



1740 
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535 



10 



15 



CATCAAAGAG CTTCAGTTTT ACCGAAATAA CATCTTCAAG AAAAAAATAG ATGAAAAGAA 780 

GAGGAAAATT ATAGAAAATG GGGAAAATGA GAAGACCGTG AGTTGATGCC TVCyiTATCATG 840 

CTGCCACTAC ATCGTTATCT GGAGGCAACT TCTGGTGGTT TTTTTTTCTC ACGCTGATGG 900 

CTTGGCAGAG CMCTTCGGTT AACTTGCATC TCCAGATTGA TTACTCAAGC AGACAGCACA 960 

CGAAATACTA TTTTTCTCCT AATATGCTGT TTCCATTATG ACACAGCAGC TCCTTTGTAA 1020 

GTACCAQGTC ATGTCCATCC CTTGGTACAT ATATGCATTT GCTTTTAAAC CATTTCTTTT 1080 

GTTTAAATAA ATAAATAAGT AAATAAAGCT AGTTCTATTG AAATGCAAAA AAAAAAAAAA 1140 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA N 1181 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1537 beise pairs 

(B) TYPE: nucleic acid ^ 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 
CTTTTTGTGT TCCGGCCGAT CCCACCTCTC CTCGACCCTG GACGTCTACC TTCCGGAGGC 
CCACATCTTG CCCACTCCGC GCGCGGGGCT AGCGCGGGTT TCAGCGACGG GAGCCCTCAA 
GGGACATGGC AACTACAGCG GCGCCGGCGG GCGQCGCCCG AANATGGAGC TGGCCCGGAA 



TGGGGAGGGT TCGAAGAAAA CATCCAGGGC GGAGGCTCAG CTGTGATTGA CATGGAGAAC 
ATGGATGATA CCTCAGGCTC TAGCTTCGAG GATATGGGTG AGCTGCATCA GCGCCTGCGC 
GAGGAAGAAG TAGACGCTGA TGCAGCTGAT GCAGCTGCTG CTGAAGAGGA GGATGGAGAG 
TTCCTGGGCA TGAAGGGCTT TAAGGGACAG CTGAGCCGGC AGGTGGCAGA TCAGATGTGG 
CAGGCTGGGA AAAGACAAGC CTCCAGGGCC TTCAGCTTGT ACGCCAACAT CGACATCCTC 
AGACCCTACT TTGATGTGGA GCCTGCTCAG GTGCGAACAG GGCTCCTGGA GTCCATGATC 
CCTATCAAGA TQGTCAACTT CCCCCAGAAA ATTGCAQGTG AACTCTATGG ACCTCTCATG 
CTGGTCTTCA CTCTGGTTGC TATCCTACTC CATGGGATGA AGACGTCTGA CACTATTATC 
CGGGAGGGCA CCCTGATGGG CACAGCCATT GGCACCTGCT TCGGCTACTG GCTGGGAGTC 
TCATCCTTCA TTTACTTCCT TGCCTACCTG TGCAACGCCC AGATCACCAT GCTGCAGATG 
TTGGCACTGC TGGGCTATGG CCTCTTTGGG CATTGCATTG TCCTGTTCAT CACCTATAAT 
ATCCACCTCC ACGCCCTCTT CTACCTCTTC TGGCTGTTGG TGGGTGGACT GTCCACACTG 



60 
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CGCATGGTAG CAGTGTTGGT GTCTCGGACC 
GGCACCCTGG CTGCCCTACA CATGCTCTTC 
- 5 GTGNTAGAGG GGATCCTGGA CACACTGGAG 
CCCAGAGACA TCCCTGCCAT GCTCCXTTGCT 
ACAGCCAAAG CTGTTGCGGT GACCCTGCAG 

10 

AGTCCTCTTT CCXTGCAGCTG CAGAGAGGAG 
TGTTTCGTAG ATGGGGTTTG CAGCTGCCAC 
15 AGCTGTCGGC ACTTCTGAAA GCACAAGGCC 
• GCAGCCAATG CAGAAAATGG GTCAGCTCCT 
TCCTCTTTAT CTCTCCCACA TTGTCTTGCT 

20 

AAAAAAAAAA AAAAAAAAAA AAAAAAGGGG 



GTGGGCCCCA CACAGCGGCT GCTCCTCTCTr 960 

CTGCTCTATC OXSCATTTTGC CTACCACAAA 1020 

GGCCCCAACA TCCCGCCCAT CCAGAGGGTC 1080 

GCTCGGCTTC CCACCACCX3T CCTCAACGCC 1140 

TCACACTGAC CCCACCTGAA ATTCTTGGCC 1200 

GAAGACTATT AAAGGACAGT CCTGATGACA 1260 

TGAGCTGTAG CTGCGTAAGT ACCTCCTTGN 1320 

AAGAACTCCT GGCCAGGACT GCAAGGCTCT 1380 

TTGAGAACCC CTCCCCACCT ACCCCTTCCT 1440 

AAATATAGAC TTGGTAATTA AAAAAAAAAA 1500 

GGNCCCC 1537 



25 

(2) INFORMATION FOR SEQ ID NO: 305: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1493 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEnNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

35 

TGCATGCCAA AACCAMSfCC .TGCCftJACAA 60 . 

TTATATTTCT ATTCACATCA TTATTGAAAA TACCCAGCTC AGTGCCTGGC TTAATAAATG 120 

40 TTTAATTCCC TTACCTACTC TTGCTCTATT TTTTTATTTG AAATGGAGAT GAGCAAAATA 180 

ACACATTCAT GGCTGAAGCA ATTTTTTGGA CATTTCTTGT TACCAAAAGA TCTATAATCA 240 

GGATGATCCT GAGCTGTTCA AACAAGCTGT ATATAAACAG ACAATGAAAC TCTTTGCAGA 300 

45 

GCTGGAAATT AAAAGGAAAG AGAGAGAAGC CAAAGAGATG CATGAAAGGA AACGACAAAG 360 

GGAAGAAGAG ATTGAAGCTC AAGAAAAAGC CAAACGGGAA AGAGAGTGGC AGAAAAACTT 420 

50 TGAGGAAAGT CGAGATGGTC GTGTGGACAG CTGGCGAAAC TTCCAAGCCA ATACGAAGGG 480 

GAAGAAAGAG AAGAAAAATC GGACCTTCCT GAGACCACCG AAAGTAAAAA TGGAGCAACG 540 

TGAGTGACCG CCCAAGGTCA CAGGCACAGA ACCTTTCCCC TGCTATCTCC CTTCCTGCTT 600 

55 

CGAAGGACTC ATTCTTTCCT CCCACTTCCA CCCCAACATA GAGTAGTATT TGCTTTTTAG 660 
TCCATTTTGT TTTCAATACG ATTTAATATC GATCAGAGTA ATTCTTTTGT ACATTGAAAT 720 
60 GAGGGGCITG GTTTAAAAAA AGACCTTTCC CTCTCCCTGC CCCTAGAACA ACCAGTATTA 780 
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537 



GAAGGTGCCA CCATTGGTGC TGCCTTCTCT TCCCACAGCC TGTAACTCAG TGTTTTGTAC 840 

TTCACTGAAT TGTCATGGTT AGAAACTTCG TGGATAGTTT GTGGAAATCA TCCAATTAAA 900 

- 5 

CATACTGCTT AAAACAGTGT TGCTGTGACT TCAGAGACAA GCCTGGAAGG OGCACCTTAG 960 

GAAGCCCCTT CGCTTCAGTT GCTTCGCTTCT GGGTGTGCTC CCTTCGAAGG CXTCAGATAAG 1020 

10 ACAGGGAACA CTTGTGAGCA CACAGAGCAG CATCTGATGC CCTGTGGTGT TTGGCATGTG 1080 

CCCCCTGTCT ACTGACXy^T CAGTGTGGCA TGAGGCCCAC GCCACCCAAA CCTTTCACTT 1140 

TCCAAAGAGC TAGCCX3TCCT CCACCCAGTA CCATGTCCTA GCCTGTCTGC ATTTGTTAGT 1200 

15 

GGTAATATTC TITATGTATA ATAAATTTTT ATACCCAAGC CATTGATGTA CTTTTCCTrTG 1260 

TACTCrCCCT TGTGGGTCCC TTGTCTGGCT TGGCTGAACC CCAAAATGCT TTGGGGTTGG 1320 

20 ACAGACCTGG CTGAACCTTA GTTTCTTCAT CTATGAAATG GGAATATGAA TTACTGCAGC 1.380 

AGCTTTTAGG GCAGATTTGC CATGGCATAT ACAAGGTAAC TACCATAGTG CTCCTTGGGT 1440 

ATTGCCAATA TCCTATTATT TCTGTGTAAA ATGAAGATAC TGATTGTTTT GAG 1493 

25 



(2) INFORMATION FOR SEQ ID NO: 306: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 577 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
35 (D) TOPOLOGY: linear 

(xi )" SEQUENi^ DESCRIPTIOnI" SE^ ID NCT: ' 3061 

AATTOGGCAG AGGNATTATA TACACTATAC TGGCATTTAC TGTTTCACCC AGCCCGGAAA 60 

40 

GTCAGAGATG TATATTGGAA AATTTACAAC TCCATCTACA TTGGTTOCCA GGACGCTCTC 120 

ATAGCACATT ACCCAAGAAT CTACAACGAT GATAAGAACA CCTATATTCG TTATGAACTT 180 

45 GACTATATCT TATAATTTTA TTGTTTATTT TGTGTTTAAT GCACAGCTAC TTCACACCTT 240 

AAACTTGCTT TGAnTGGTG ATGTAAACTT TTAAACATTG CAGATCAGTG TAGAACTGGT 300 

CATAGAGGAA GAGCTAGAAA TCCAGTAGCA TGATTTTTAA ATAACCTGTC TTTGTTTrTG 360 

50 

ATGTTAAACA GTAAATGCCA GTAGTGACCA AGAACACAGT GATTATATAC ACTATACTGG 420 

AGGGATTTCA TTTTTAATTC ATCTTTATGA AGATTTAGAA CTCATTCCTT GTGTTTAAAG 480 

55 GGAATGTTTA ATTGAGAAAT AAACATTTGT GWACAAAATG YTAAAAAAAA AAAAAAAAAA 540 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AACTCGA 577 



60 
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(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 
- 5 (A) LENGTH: 2860 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

GTGTNGACCG CTCTCNCAAT ATGGCTCCCC CGGGCTGGCA GRWRKTCRGT CWCKRGTGGC 60 

TAGCCTGTCC TGACAGGGGA GAGTTAAGCT CCCGTTCTCC ACCGTGCCGG CTGGCCAGGT 120 

15 

GGGCTGAGGG TGACCGAGAG ACCAGAACCT GCTTGCTGGA GCTTAGTGCT CAGAGCTGGG 180 

GAGGGAGGTT CCGCCGCTCC TCTGCTGTCA GCGCCGGCAG CCCCTCCCGG CTTCACTTCC 240 

20 TCCCGCAGCC CCTGCTACTG AGAAGCTCCG GGATCCCAGC AGCCGCCACG CCCTGGCCTC 300 

AGCCTGCGGG GCTTCCAGTC AGGCCAACAC CGACGCGCAC TGGGGAGGAA GACAGGACCC 360 

TTGACATCTC CATCTGCACA GAGGTCCTGG CTGGAACCGA GCAGCCTCCT CCTCCTAGGA 420 

25 

TGACCTCACC CTCCAGCTCT CCAGTTTTCA GGTTGGAGAC ATTAGATGGA GGCCAAGAAG 480 

ATGGCTCTGA GGCGGACAGA GGAAAGCTGG ATTTTGGGAG CGGGCTGCCT CCCATGGAGT 540 

30 CACAGTTCCA GQGCGAGGAC CGGAAATTCG CCCCTTCAGA TAAGAGTCAA CCTCCAACTA 600 

CCGAAAGGGA ACAGGTGCCA GTCAGCCGGA TCCAAACCGA TTTGACCGAG ATCGGCTCTT 660 

CAATGCGGTC TCCCGGGGTG TCCCCGAGGA TCTGGCTGGA CTTCCAGAGT ACCTGAGCAA 720 

35 

_ _ GACCAGCAAG. TACCTCACCG ACTTCGGAAA. TACACAGAGG GCTCCACAGG ^TAAGAC^GTC . _ 780 

TGATGAAGGC TGTGCTGAAA CCTTAAGGAC GGGGTCAATG CCTGCATTCT GCC7CTGCTG 840 

40 CAGATCGACC GGGACTCTGG CAATCCTCAG CCCCTGGTAA ATGCCCAGTG CACAGATGAC 900 

TATTACCGAG GCCACAGCGC TCTGCACATC GCCATTGAGA AAGAGGAGTC TGCAGTGTGT 960 

GAAGCTCCTG GTGGAGAATG GGGCCAATGT GCATGCCCGG GTCTGCGGCG ACTTCTTCCA 1020 

45 

GAAGGGCCAA GGGACTTGCT TTTATTTCGG TGAGCTACCC CTCTCTTTGG CCGCTTGCAC 1080 

CAAGCAGTGG GATGTGGTAA GCTACCTCCT GGAGAACCCA CACCAGCCCG CCAGCCTGCA 1140 

50 GGCCACTGAC TCCCAQGGCA ACACAGTCCT GCATGCCCTA OTGGATGATC TCGGACAACT 1200 

CAGCTGAGAA CATTGCACTG GTGACCAGCA TGTATGATGG GCTCCTCCAA GCTKGGGSCC 1260 

SCCYTCTGCC CTACCGTGCA GCTTGAGGAC ATCCGCAACC TGCAGGATCT CACGCCTCTG 1320 

55 

AAGCTGGCCG CCAAGGAGGG CAAGATCGAG ATTTTCAGGC ACATCCTGCA GCGGGAGTTT 1380 

TCAGGACK3A GCCACCTTTC CCGAAAGTTC ACCGAGTGGT GCTATGGGCC TGTCCGGGTG 1440 

60 TCGCTGTATG ACCTGGCTTC TGTGGACAGC TGTGAGGAGA ACTCAGTGCT GGAGATCATT 1500 
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10 



15 



20 
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30 



35 



40 



45 



GCCTTTCATT GCAAGAGCCC GCACCGACAC CGAATGGTCG TTTTGGAGCC CCTGAACAAA 1560 

CTGCTGCAGG CGAAATGGGA TCTGCTCATC CCCAAGTTCT TCTTAAACTT CCTGTGTAAT 1620 

CTGATCTACA TGTTCATCTT CACCGCTGTT GCCTACCATC AGCCTACCCT GAAGAAGCAG 1680 

GCCGCCCCTC ACCTGAAAGC GGAGGTTGGA AACTCCATGC TGCTGACX3GG CXIACATCCTT 1740 

ATCCTGCTAG GGGGGATCTA CCTCCTCGTG GGGCCAGCTG TGGTACTTCT GGCGGCGCCA 1800 

CGTGITCATC TGGATCTCGT TCATAGACAG CTACTTTGGA AATCCTCITC CTGTTCCAGG 1860 

CCCTGCTTCA CAGTGGTGTC CXAGGTGCTG TGTTTCCTGG GCCATCGAGT GGTACCTGCC 1920 

CCTGCTTGTG TCTGCGCTGG TGGCTGGGCT GGCTGAACCT GCTTTACTAA TACACGTGGC 1980 

GTTCCAGCAC ACAGGCAGTC TACAGTTTCA TGWTCCCTGA AGCCXTTGCTG AGCCTGAGCC 2040 

AGGAGGCTTG C5CGCCCCGAA GCTCCTACAG GCCCCAATGC CACAGAGTCA GTGCAGCXXA 2100 

TQGAQGGACA GGAGGACGAG GGCAACGGGG CTCAGTACAG GGGTATCCTC3 GAAGCCTCCT 2160 

TGGAGCTCTT CAAATTCACC ATCGGCATGG GCGAGCTGGC CTTCCAGGAG CAGCTGCACT 2220 

TCCGCGGCAT GGTGCTGCTG CTGCTGCTGG CCTACGTGCT GCTCACCTAC ATCCTGCTGC 2280 

TCAACATGCT CATCGCCCTC ATGAAGCGAA CGTCACAGTG TCGCCACTGA CAGCTGGAGC 2340 

ATCTGGAAGC TGCAGAAAGC CATCTCTGTC CTGGAGATGG AGAATGGCTA TTGGTGGTGC 2400 

AGGAAAAAGC AGCGGGCAGG TGTGATGCTG ACCGTTGGCA CTAAGCCCAG ATGGCAGCCC 2460 

CGATGAGCGC TGGTGCTTCA GGGTGGAGGA GGTGAACTGG GCTTCATGGG GAGCAGACGC 2520 

TGCCTACGCT GTGTGAGGAC CCX3TCAGGGG CAGGTGTCCC TCGAACTCTC GAGAACCCTG 2580 

TCCTGGCTTC CCCTCCCAAG GAGGATGAGG ATGGTGCCTC TGAGGAAAAC TATGTGCTCG 2640 

TCCAGCTCCT CCAGTCCAAC TGATGGCCCA GATGCAGCAG GAGGCCAGAG GACAGAGCAG 2700 

AGGATCTTTC CAACCACATC TGCTGGCTCT GGGGTCCXZAG TGAATTCTQG TGGCAAATAT 2760 

ATATTTTCAC TAACTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAVGAGGG GGGGCCCGKT 2820 

ASCCAAWTTC GCCCTATAAG OXSAGTGCCWA TTACGATAAA 2860 



50 

(2) INFORMATICS FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 base pairs 
55 (B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

60 
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CTGCTTGTGT CTGCGCTGGT GCTGGGCTGG 
CAGCACACAG GCATCTACAG TGTCATGATC 
- 5 TTGGCGCXrCC GAAGCTCCTA CAGGCCCCAA 
ACAGGAGGAC GAGGGCAACG GGGCCCAGTA 
CTTCAAATTC ACCATCGGCA TGGGCGAGCT 

10 

CATGGTGCTG CTGCTGCTGC TGGCCTACGT 
GCTCATCGCC CTCATGNAGC GAGACCGWCA 
15 GGAAGCTGCA GAAAGCCATC TCTGTCCTGG 
AGAAGC7\GCG GGCAGGTGTG ATGCTGACCG 
AGCGCTGGTG CTTCAGGGTG GAGGAGGTGA 

20 

CGCTGTGTGA GGACXXX3TCA GGQGCAGGTG 
CTTCCCCTCC CAAGGAGGAT GAGGATGGTG 
25 TCCTCCAGTC CAACTGATGG CCCAGATGCA 
TTTCCAACCA CATCTGCTGG CTCTGGGGTC 
TCACTAAMWM AAAAAAAAAA AAAAAAAAAA 

30 



540 



CTGAACCTGC TTTACTATAC ACXnX3GCTTC 60 

CAGAAGCCCT GGTGAGCCTG AGCCAGGANN 120 

TGCCACAGAG TCAGTGCAGC CXATGGAGGG 180 

CAQGGGTATC CTGGAAGCCT CCTTGGAGCT 240 

GGCCTTCCAG GAGCAGCTGC ACTTCCGCGG 300 

GCTGCTCACC TACATCCTGC TGCTCAACAT 360 

ACAGTGTCGC CACTGACAGC TGGAGCATCT 420 

AGATGGAGAA TGGCTATTGG TGGTGCAGGA 480 

TTGGCACTAA GCCAGATGGC AGCCCCGATG 540 

ACTGGGCTTC ATGGGAGCAG ACGCTGCCTA 600 

TCCCTCGAAC TCTCGAGAAC CCTGTCCTGG 660 

CCTCTGAGGA AAACTATGTG CTCGTCCAGC 720 

GCAGGAGGCC AGAGGACAGA GCAGAGGATC 780 

CCAGTGAATT CTGGTGGCAA ATATATATTT 840 

ACTCGA .876 - 



35 



(2) INFORMATION FOR SEQ ID NO: 309: 



_ . . _ Xi) -SEQUENCE. CHARACTERISTICS r _ „. . ^ _ .. _ . _ . 

(A) LENGTH: 2025 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

CATGACCCGC CTGATGCGAT CCCGCACAGC CTCTGGTTCC AGCGTCACTT CTCTGGATGG 60 

45 

CACCCGCAGC CGCTCCCACA CCAGCGAGGG CACCCGAAGC CGCTCCCACA CCAGCGAGGG 120 

CACCCGCAGC CGCTCGCACA CCAGCGAGGG GGCCCACCTG GACATCACCC CCAACTCGGG 180 

50 TOCTGCTGGG AACASGCCGG GCCCAAGTCC ATGGAGGTCT CCTGCTAGGC GGCCTGCCCA 240 

GCTGCCGCCC CCGGACTCTG ATCTCTGTAG TGGCCCCCTC CTCCCCGGCC CCTTTTCGCC 300 

CCCTGCCTGC CATACTGCGC CTAACTCGGT ATTAATOCAA AGCTTATTTT GTAAGAGTGA 360 

55 

GCTCTGGTCG AGACAAATGA GGTCTATTAC GTGGGTGCCC TCTCCAAAGG CGGQGTGGCG 420 

GTCGACCAAA GGAAGGAAGC AAGCATCTCC GCATCGCATC CTCTTCCATT AACCAGTGGC 480 

60 CGGTTGCCAC TCTCCTCCCC TCCCTCAGAG ACACCAAACT GCCAAAAACA AGACGCGTAC 540 
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AGCACACACT TCACAAAGCC AAGCCTAGGC CGCCCTGAGC ATCCTGGTTC AAACGGGTGC 600 

CTGGTCAGAA GGCCAGCCGC CCACTTCCCG TTTCCTCTTT AACTGAGGAG AAGCTGATCC 660 

AGTTTCCGGA AACAAAATCC TTTTCTCATT TGGGGAGGGG GGTAATAGTG ACATGCAGGC 720 

ACCTCTTTTA AACAGGCAAA ACAGGAAGGG GGAAAAGGTG GGATTCATGT CGAGGCTAGA 780 

10 GGCATITGGA ACAACAAATC TACGTAGTTA ACTTGAAGAA ACCGATTTTT AAAGTTGGTG 840 

CATCTAGAAA GCTTTGAATG CAGAAGCAAA CAAGCTTGAT TTTTCTAGCA TCCTCTTAAT 900 

GTGCAGCAAA AGCAGGCRAC AAAATCTCCT GGCTTTACAG ACAAAAATAT TTCAGCAAAC 960 

15 

GTTGGGCATC ATGGTTTTTG AAGGCTTTAG TTCTGCTTTC TGCCTCTCCT CCACAGCXTCC 1020 

AACCTCCCAC CCCTGATACA TGAGCCAGTG ATTATTCTTG TTCAGGGAGA AGATCATTTA 1080 

20 GATTTGTTTT GCATTCCTTA GAATGGAGGG CAACATTCCA CAGCTGCCCT GGCTGTGATG 1140 

AGTGTCCTTG CAGGGGCCGG AGTAGGAGCA CTGGGGTGGG GGCGGAATTG GGGTTACTCG 1200 

ATGTAAGGGA TTCCTTGTTG TTGTGTTGAG ATCCAGTGCA GTTGTGATTT CTGTGGATCC 1260 

25 

CAGCrrGGTT CCAGGAATTT TGTGTGATTG GCTTAAATCC AGTTTTCAAT CTTCGACAGC 1320 

TGGGCTGGAA CGTGAACTCA GTAGCTGAAC CTGTCTGACC CGGTCACX5TT CTTGGATCCT 1380 

30 CAGAACTCTT TGCTCTTCTC GGGGTGGGGG TGGGAACTCA CX3TQGGGAGC GGTGGCTGAG 1440 

AAAATGTAAG GATTCTGGAA TACATATTCC ATGGGACTTT CCTTCCCTCT CCTGCTTCCT 1500 

CTTTTCCTGC TCCCTAACCT TTCGCCGAAT GGGGCAGCAC CACTGACGTT TCTGGGCGGC 1560 

35 

CAGTGCXSGCT GCCAGGOTCC TGTACTACTG CCTTGTACTT TTCATTTTGG CTCACCGTGG 1620 

ATTTTCTCAT AGGAAGTTTG GTCAGAGTGA ATTGAATATT GTAAGTCAGC CACTGGGACC 1680 

40 CGAGGATTTC TGGGACCCCG CAGTTGGGAG GAGGAAjsTAG TCCAGCCTTC CAGGTGGCGT 1740 

GAGAQGCAAT GACTCGTTAC CTGCCGCCCA TCACCTTGGA GGCXTTTCCCT GGCXTTTGAGT 1800 

AGAAAAGTCG GQGATCGGGG CAAGAGAGGC TGAGTACGGA TGGGAAACTA TTGTGCACAA 1860 

45 

GTCmCCAG AGGAGTTTCT TAATGAGATA ITTGTATrTA TTTCCAGACX: AATAAATTTG 1920 

TAACTTTGCA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAACTC 1980 

50 GAGGGGQGCC CX5TACCCAAT TCGCCGTATA TGATCGTAAA CAATC 2025 



55 (2) INFORMATIC»J FOR SEQ ID NO; 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3026 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS : double 



SDOCID: <WO_9839448A^IA> 



wo 98/39448 



542 



PCT/US98/04493 



(D) TOPOIjOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

" 5 TAGGCAGCAC TGAAATATCC TAACCCCCTA AGCTCCAGGT GCCCTGTGGN ACGAGCAACT 60 

GGACTATAGC AGGGCTGGGC TCTGTCTTCC TGGTCATAGG CTCACTCTTT CCCCCAAATC 120 

TTCCTCTGGA GCTTTGCAGC CAAGGTGCTA AAAGGAATAG GTAGGAGACC TCTTCTATCT 180 

10 

AATCCTTAAA AGCATAATGT TGAACATTCA TTCAACAGCT GATGCCCTAT AACCCCTGCC 240 

TGGATTTCTT CCTATTAGGC TATAAGAAGT AGCAAGATCT TTACATAATT CAGAGTGGTT 300 

15 TCATTGCCTT CCTACCCTCT CTAATGGCCC CTCCATTTAT TTGACTAAAG CATCACACAG 360 

TGGCACTAGC ATTATACCAA GAGTATGAGA AATACAGTGC TTTATGGCTC TAACATTACT 420 

GCCTTCAGTA TCAAGGCTGC CTGGAGAAAG GATGGCAGCC TCAGGGCTTC CTTATGTCCT 480 

20 

CCACCACAAG AGCTCCTTGA TGAAGGTCAT CTTTTTCCCC TATCCTGTTC TTCCCCTCCC 540 

CGCTCCTAAT GGTACGTGGG TACCCAGGCT GGTTCTTCGG CTAGGTAGTG GGGACCAAGT 600 

25 TCATTACCTC CCTATCAGTT CTAGCATAGT AAACTACGGT ACCAGTGTTA GTOGGAAGAG 660 

CTGGGTTTTC CTAGTATACC CACTGCATCC TACTCCTACC TGGTCAACCC GCTGCTTCCA 720 

GGTATGGGAC CTGCTAAGTG TGGAATTACC TGATAAGGGA GAQGGAAATA CAAGGAGGQC 780 

30 

CTCTGGTGTT CCTGGCCTCA GCCAGCTGCC CACAAGCCAT AAACCAATAA AACAAGAATA 840 

CTGAGTCAGT TTTTTATCTG GGTTCTCTTC ATTCCCACTG CACTTGGTGC TGCTTTGGCT 900 

35 GACTGGGAAC ACCCCATAAC TACAGAGTCT GACAGGAAGA CTGGAGACTG TCCACTTCTA 960 

GCTCGGAACT TACTGTGTAA ATAAACTTTC AGAACTGCTA CCATGAAGTG AAAATGCCAC 1020 

ATTTTGCTTT ATAATTTCTA CCCATGTTGG GAAAAACTGG CTTTTTCCCA GCCCTTTCCA 1080 

40 

GGGCATAAAA CTCAACCCCT TCGATAGCAA GTCCCATCAG CCTATTATTT TTTTAAAGAA 1140 

AACTTGCACT TGTTTTTCTT TTTACAGTTA CTTCCTTCCT GCCCCAAAAT TATAAACTCT 1200 

45 AAGTGTAAAA AAAAGTCTTA ACAACAGCTT CTTGCTTGTA AAAATATGTA TTATACATCT 1260 

GTATTTTTAA ATTCTGCTCC TGAAAAATGA CTGTCCCATT CTCCACTCAC TGCATTTQGG 1320 

GCCTTTCCCA TTGGTCTGCA TGTCTTTTAT CATTGCAGGC CAGTGGACAG AGGGAGAAGG 1380 

50 

GAGAACAGGG GTCGCCAACA CTTGTGTTGC ITTCTGACTG ATCCTGAACA AGAAAGAGTA 1440 

ACACTGAGGC GCTCGCTCCC ATGCACAACT CTCCAAAACA CTTATCCTCC TGCAAGAGTG 1500 

55 GGCTTTCCAG GGTCTTTACT GGGAAGCAGT TAAGCCCCCT CCTCACCCCT TCCTTTTTTC 1560 

TTTCTTTACT CCTTTGGCTT CAAAGGATTT TGGAAAAGAA ACAATATGCT TTACACTCAT 1620 

TTTCAATTTC TAAATTTGCA GGGGATACTG AAAAATACGG CAGGTGGCCT AAQGCTGCTG 1680 
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TAAAGTTGAG GGGAGAGGAA ATCTTAAGAT TACAAGATAA AAAACGAATC CCCTAAACAA 1740 

AAAGAACAAT AGAACTGGTC TTCCATTTTG CCACCTTTCC TGTTCATGAC AOCTACTAAC 1800 

- 5 CTGGAGACAG TAACATTTCA TTAACCAAAG AAAGTGGGTC ACCTGACCTC TGAAGAGCTG 1860 

AGTACTCAGG CCACTCCAAT CACCCTACAA GATGCCAAGG AGGTCCCAGG AAGTCCAGCT 1920 

CCTTAAACTG ACGCTAGNMA ATAAACCTGG GCAAGTGAGG CAAGAGAAAT GAGGAAGAAT 1980 

10 

CCATCTGTGA GGTGAYAGGC AAGGATGAAA GACAAAGAAG GAAAAGAGTA TCAAAGGCAG 2040 

AAftGGAGATC ATTTAOrrGG GTCTGAAAGG AAAAGTCTTT GCTATCCX3AC ATGTACTGCT 2100 

15 AGTACCTGTA AGCATTTTAG GTCCCAGAAT GGAAAAAAAA ATCAGCTATT GGTAATATAA 2160 

TAATGTCCTT TCCCTGGAGT CAGTTTTTTT AAAAAGTTAA CTCTTAGTTT TTACTTGTTT 2220 

AATTCTAAAA GAGAAGGGAG CTCAGGCCAT TCCCTGTAGG AGTAAAGATA AAAGGATAGG 2280 

20 

AAAAGATTCA AAGCTCTAAT AGAGTCACAG CTTTCCCAGG TATAAAACCT AAAATTAAGA 2340 

AGTACAATAA GCAGAGGTCG AAAATGATCT AGTTCCTGAT AGCTACCCAC AGAGCAAGTG 2400 

25 A1TTATAAAT TTGAAATCCA AACTACTTTC TTAATATCAC TTTGGTCTCC ATTTTTCCCA 2460 

GGACAGGAAA TATGTCCCCC CCTAACTTTC TTGCTTCAAA AATTAAAATC CAGCATCCCA 2520 

AGATCATTCT ACAAGTAATT TTGCACAGAC ATCTCCTCAC CCCAGTGCCT GTCTGGAGCT 2580 

30 

CACCCAAGGT CANCCAAACA ACTTGGTTGT GAACCCAACT GCCTTAACCT TCTGGGGGAG 2640 

GGGGATTAGC TAGACTAGGA GACCXIAGAAG TGAATGGGAA AGGOTGAGGA CTTCACAATG 2700 

35 TTGGCCTGTC AGAGCTTGAT TAGAAGCCAA GACAGTQGCA GCAAAGGAAG ACTTGGCCX:A 2760 

GGAAAAACCT GTGGGTTGTG CTAATTTCTG TCCAGAAAAT AGGGTGGACA GAAGCTTGTG 2820 

GGGTGCATGG AGGAATTGGG ACCTGGTTAT GTTGTTATTC TCGGACTGTG AATTTTGGTG 2880 

40 

ATGTAAAACA GAATATTCTG TAAACCTAAT GTCTGTATAA ATAATGAGCG TTAACACAGT 2940 

AAAATATTCA ATAAGAAGTC AAAAAAAAAA AAAAAAAACT CGAQGGGGGG CCCGGTACXX: 3000 

45 AATTTOCXrAA ATAGAGATNG TATTAC 3026 



50 (2) INFORMATIC»J FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 712 base pairs 

(B) TVTE: nucleic acid 
55 (C) STRANDEt»JESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 
60 GCAGGCTTTG TGCTCACCTA CAAGCTGGGT GAGCAGGGTG CCAGCAGCCT GTTTCCTCTT 



60 
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CTCCTGCTGG ACCACGGCGT TTCTGCTCCC GAGTTGGGAC TGTGGAATGG TGTGGGTGCT 120 

GTCGTCTGCT CCATCGCTGG CTCCTCCCTG GGTGGGACCT TGCTGGCX:AA GCACTGGAAA 180 

. 5 

CTGCTGCCTC TGTTGARGTC GGTGCTGCGC TTCCGCCTTCG GGGGCCTAGC CTGTCAGACT 240 

GCCTTGGTCT TCCACCTGGA CACCCTGGGG GCCAGCATGG ACGCTGGCAC AATCTTGAGA 300 

10 GGGTCAGCCT TGCTGAGCCT ATGTCTGCAG CACTTCTTGG GAGGCCTGGT CACCACAGTC 360 

ACCTTCACTG GGATGATGCG CTGCAGCCAG CTGGCCCCCA GGQCCTGCAG GCCACACACT 420 

ACAGCCTTCT GGCCACGCTG GAGCTGCTGG GGAAGCTGCT GCTGGGCACT CTGCGGAGGC 480 

15 

CTGGCTGATG GGTTGGGGCC ACATCCCTGC TTCTTGCTCC TGCTCATCCT CTCTGCCTTT 540 

CCCGTTCTGT ACXrTGGACCT AGCACCCAGC ACCTTTCTCT GAGCTGAGTG GCTGGAGTGG 600 

20 TCAATAAAGC CACATGTGCC TGTGGCCCAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 660 

AACTGGAGGG GGGGCCCGGT ACCCAAATCG CCGGATATGA TCGTAAACAA TC 712 



25 



(2) INFOPMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1289 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linecir 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

CAAAATTTCA GAACTTTCAG GAGGGCAWSA' GAATATCAAA CAAAGATTTC TGGAAGTATT "60 

TTGCCAACCT TCTGGTTGAG CTGCAAGAAA ATATTTATGG TGAGAACTTT TCTGTTTCCC 120 

40 

GTTATTGGGT TTTTGGTTGG TTTTTGTTTG TTTTTTACTA TGCTTTGGTC TGTAAAAATA 180 

TGCAACTGAA CTACATTCAG AAGGAAATAT TGTCTACATA GAATATTATA TGAAGTTGGT 240 

45 ACATAATTCT GATGAGGAAA AAAAATCTTT GCAATTCTTT AAGCCATATT GTTGTTTTTC 300 

TGTGTTGTTT TCCCTGGATG AAAATATCAG TATTAAGTAG ACAGCATATT ATTCAAGTGT 360 

TTAGACTTAT TAATATGTTC TTGTCCTGTA TTTATACATA TGTGTATTTT GGAAAGTATT 420 

50 

GCCTTTTTTA AGGGAAGCTA TAATTCGATA CATAGTGAAA AAGGGAATGG TGACCCCTTT 480 

GTGCCTCTTC CACTGAGGAT AACAAACAGC ATTGTAATCC ATTCTCTTGC ACCTTCTTCT 540 

55 TCTTATCTTG TTATTACGGT TTTATTAATT TTGTAGAGGG ACAGGGAGTG GGCAAGGGGA 600 

AGAAGCAGCT TATTTGACTA ACCAGCCCCT CTGTGGTCCA CCAGCGTCTT GGCTTGGTGG 660 

GAGGGCTCTC AATCAGCAGG GCCCCAGGAG GGAAGAAGAA GTGGGGCAAA GCCTGGCCTC 720 

60 
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10 



15 



20 



GCCGCTCGGG AGCTTTGCCA TCTGAGCCAC GCCTCCTCCA GGCCATGCTC CTTGAACTTC 780 

GAAATGTCAA CCGGAGCCCT TACACCAGCC CTCCAGCATC TAATAGACTT GAATCTACTC 840 

TAAACGAATA TTTAATCCAA CCTCACTACA TTGTAGCTCA GTCCAACGAC TAACCCTGAA 900 

ATGGGGGTGT TCCAGCCTTC AGCGAGATGG CCAAGCGGTC CCCTGGGGGC TGTGGCAGCG 960 

GGCTTATCCT TCTCTGTTGC CAACCTTGCC GTCCGACXTTC CTCCGCCCCC ATGCGGTGAC 1020 

CCCGTCCGTG TCTGTGTCTG TCCATACGTG TGAGTCCAGC TAAAAAGACA AAACAGAACC 1080 

CGTGGGCXrCA GCTCGGAAGG TGCGTGGAGA AGGCTCCGAC GTCTCCGAAG TGCAGCCCTT 1140 

GGGATGGCAT TCCGTTGTGT GCCTTATTCC TGGAGAATCT GTATACX3GCT CGCCTATAGA 1200 

AATATAGCCT CTTCATGCTG TATTAAAAGG ACTTTTAAAA GCAAAAAAAA AAAAAAAAAA 1260 

CTTGAGGGGG GGNCCGGTAC CCAATTNTC 1289 



25 



30 



35 



(2) INFOEIMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOliOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

Met Phe Leu He Phe Val Tyr Phe Leu Lys He Leu Phe Ser Ser Ser 
15 10 15 

Leu Pro Phe Leu Trp Leu 
20 



40 (2) INFORMATION FOR SEQ ID NO: 314: 



45 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 



50 



Met Met Phe Leu Thr Gin Gly Gly Pro Leu Pro Ser Thr Arg Ala Arg 
15 10 15 

Pro Thr Cys Gin Ala Gly Ala Leu Pro Lys Pro Ser Gly Leu Leu Gly 
20 25 30 



Val Thr Cys Trp Asn Gly Leu Lys Gly Pro Leu Cys Gly Asn Arg Cys 
55 35 40 45 



Ser Pro Asn Thr Leu Leu Leu Ala Ala Arg Gin Ala Leu Trp Lys Gly 
50 55 60 

60 Arg Gly Arg Thr His Gin Asp Leu Pro Gly Pro Leu Gin Gly Arg Gin 
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65 70 



546 



75 . 80 



Leu Gly Pro Glu Pro Lys His Leu Ala Leu Leu Pro Pro Arg Gly Gin 
85 90 95 

-5 

Glu Ala Ser Trp Ala Ser Ser Leu Pro Gly Gin Gly Pro Leu Pro Leu 
100 105 110 

Pro His lie Asn Cys Thr Val Phe Ser Leu Lys Ala Ser Phe He Lys 
10 115 120 125 



15 



(2) INFORMATION FOR SEQ ID KKD: 315: 



(i) SEQUENCE CHARACTERISTICS: 
20 <A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

25 Met Gin Phe Leu Leu Thr Ala Phe Leu Leu Val Pro Leu Leu Ala Leu 
15 10 15 

Cys Asp Val Pro He Ser Leu Gly Phe Ser Pro Ser 
20 25 

30 



(2) INFORMATION FOR SEQ ID NO: 316: 

35 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 

40 

Met Asp Gly Phe Ser Ser Arg Leu Phe Ser Ser Leu Pro Phe Val Ala 
15 10 15 

Leu Gin Trp Phe lie Val lie Ser His Leu Leu Ser Leu Ser Leu Ser 
45 20 25 30 

Ala Cys Cys Tyr Gin Thr His Cys Ser Leu Xaa Gin Leu Ser Ser Ala 
35 40 45 

50 Phe Ser Xaa Met Gly Glu Ser Cys Val Gly Glu Arg Glu Tyr Xaa Phe 
50 55 60 



55 

(2) INFORMATION FOR SEQ ID NO: 317: 
60 (i) SEQUENCE CHARACTERISTICS: 



•lOOCtD: <W0 9im44«A2 IA> 
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(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linecir 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

- 5 

Met Pro Leu lie Asn Leu Leu Leu Leu Tyr Tyr Val Pro Asn Gly Gly 
1.5 10 15 

Lys Gin Asp Lys Lys 
10 20 



(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

Met Gly Arg His Leu Val Leu Val Met Phe lie Thr Thr Ser Leu His 
15 10 15 

25 Ser Gly Thr Pro Val Pro Glu Asn Val He Cys Gly Val Thr Lys Gly 
20 25 30 

Pro Gin Gly Lys Lys Lys Lys 
35 

30 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 319: 

35 (i) SEQUENCE CHARACTERISTICS: 

jA) LENGTH: 33 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 



40 



Met Leu Trp Trp Ser Arg Asp Tyr Thr Met Val Phe Leu Leu Phe Thr 
15 10 15 



Met Val Phe Thr Gly Asp Leu Val He Arg Gly Arg Thr Glu Leu Ser 
45 20 25 30 

Leu 



50 

(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

55 (A) LEITCTH: 88 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

60 Met Val Cys Ser Ser Leu Cys Asp lie Gly Gly lie lie Thr Pro Phe 
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10 



15 



- 5 



He Val Phe Arg Leu Arg Glu Val Trp Gin Ala Leu Pro Leu lie Leu 
20 25 30 

Phe Ala Val Leu Gly Leu Leu Ala Ala Gly Val Thr Leu Leu Leu Pro 
35 40 45 



Glu Thr Lys Gly Val Ala Leu Pro Glu Thr Met Lys Asp Ala Glu Asn 
10 50 55 60 

Leu Gly Arg Lys Ala Lys Pro Lys Glu Asn Thr He Tyr Leu Lys Val 
65 70 75 80 



15 Gin Thr Ser Glu Pro Ser Gly Thr 

85 



20 (2) INFORMATIC»I FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 cunino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

Met Gin Pro Gly Ala Gly Val Leu Val Leu Gly Leu Leu Leu Pro Pro 
15 10 15 

30 

Pro Gin Ser Pro Ser Leu Ser 
20 



35 

(2) INFORMATION FOR^ SEQ ID NO: 322: ------ „ - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 322: 

Met Thr Phe Thr Leu Gly Asp Ser Gin Val Leu Leu lie Asn Leu Phe 
45 1 5 10 15 

Pro Ser Met Pro Ser Gly Ser Cys Ala Arg Pro 
20 25 

50 

(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 



60 Met Cys Leu Glu Cys Trp Ala Glu Asn Leu Gly Pro His His Thr Ser 
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1 5 10 15 - 

Ser Leu Leu Asn Pro Arg His Leu Pro Ser lie Pro Ala Met Phe Pro 
20 25 30 

5 

Val Ser Ser Gly Cys Phe Gin Glu Gin Gin Glu Met Asn Lys Ser Leu 
35 40 45 

Val Ser Cys Leu Phe Val Leu His Phe Val Leu His Cys lie Phe Xaa 
10 50 55 60 



15 



30 



45 



(2) INFORMATION FOR SEQ ID NO: 324: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 196 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
<xi3 SEQUENCE DESCRIPTION: SEQ ID NO: 324: 

25 Met Leu Ser Thr Ser Glu Tyr Ser Gin Ser Pro Lys Met Glu Ser Leu 
15 10 15 

Ser Ser His Arg lie Asp Glu Asp Gly Glu Asn Thr Gin lie Glu Asp 
20 25 30 



60 



Thr Glu Pro Met Ser Pro Val Leu Asn Ser Lys Phe Val Pro Ala Glu 
35 40 45 



Asn Asp Ser lie Leu Met Asn Pro Ala Gin Asp Gly Glu Val Gin Leu 
35 50 55 60 



Ser Gin Asn Asp Asp Lys Thr Lys Gly Asp Asp Thr Asp Thr Arg Asp 
65 70 75 80 

40 Asp lie Ser He Leu Ala Thr Gly Cys Lys Gly Arg Glu Glu Thr Val 

85 90 95 



Ala Glu Glu Val Cys He Asp Leu Thr Cys Asp Ser Gly Ser Gin Ala 
100 105 110 

Val Pro Ser Pro Ala Thr Arg Ser Glu Ala Leu Ser Ser Val Leu Asp 
115 120 125 



Gin Glu Glu Ala Met Glu He Lys Glu His His Pro Glu Glu Gly Ser 
50 130 135 140 

Ser Gly Ser Glu Val Glu Glu He Pro Glu Thr Pro Cys Glu Ser Gin 
145 150 155 160 

55 Gly Glu Glu Leu Lys Glu Glu Asn Met Glu Ser Val Pro Leu His Leu 

165 170 175 



Ser Leu Thr Glu Thr Gin Ser Gin Gly Leu Cys Leu Arg Arg His Pro 
180 185 190 
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Lys Lys Lys Lys 
195 



(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I-ENGTH: 252 amino acids 
10 (B) TYPE: amino acid 

<D) TOPOIjOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

Met Gly Gly Asp Leu Val Leu Gly Leu Gly Ala Leu Arg Arg Arg Lys 
15 1 5 10 15 

Arg Leu Leu Glu Gin Glu Lys Ser Leu Ala Gly Trp Ala Leu Val Leu 
20 25 30 

20 Ala Xaa Xaa Gly lie Gly Leu Met Val Leu His Ala Glu Met Leu Trp 
35 40 45 



25 



Phe Gly Gly Cys Ser Ala Val Asn Ala Thr Gly His Leu Ser Asp Thr 
50 55 60 

Leu Trp Leu He Pro He Thr Phe Leu Thr He Gly Tyr Gly Asp Val 
65 70 75 80 



Val Pro Gly Thr Met Trp Gly Lys He Val Cys Leu Cys Thr Gly Val 
30 85 90 95 

Met Gly Val Cys Cys Thr Ala Leu Leu Val Ala Val Val Ala Arg Lys 
100 105 110 

35 Leu Glu Phe Asn Lys Ala Glu Lys His Val His Asn Phe Met Met Asp 

115 - - - - 120 - - 125 - - 



40 



He Gin Tyr Thr Lys Glu Met Lys Glu Ser Ala Ala Arg Val Leu Gin 
130 135 140 

Glu Ala Trp Met Phe Tyr Lys His Thr Arg Arg Lys Glu Ser His Ala 
145 150 155 160 



Ala Arg Xaa His Gin Arg Xaa Leu Leu Ala Ala He Asn Ala Phe Arg 
45 165 170 175 

Gin Val Arg Leu Lys His Arg Lys Leu Arg Glu Gin Val Asn Ser Met 
180 185 190 

50 Val Asp He Ser Lys Met His Met He Leu Tyr Asp Leu Gin Gin Asn 
195 200 205 



55 



Leu Ser Ser Ser His Arg Ala Leu Glu Lys Gin He Asp Thr Leu Ala 
210 215 220 

Gly Lys Leu Asp Ala Leu Thr Glu Leu Leu Ser Thr Ala Leu Gly Pro 
225 230 235 240 



Arg Gin Leu Pro Glu Pro Ser Gin Gin Ser Lys Xaa 
60 245 250 
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(2) INFORMATION FOR SEQ ID NO: 326: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

Met Trp Arg Cys Arg Gly Lys Leu Ser Phe Pro L»eu Phe Ala Val Val 
15 10 15 

15 He Val Ser Cys Arg Lys Asp Gly Pro Asp Ala Ala Ala Ala Pro Ala 
20 25 30 



20 



30 



45 



Val He Lys Asn Asn Ser His Tyr Gin Thr Ser Lys Ala Leu Glu Leu 
35 40 45 

Glu Lys Thr Thr Glu Asn Lys Glu Ser Asn Pro Phe He Leu Gin Val 
50 55 60 



Asn Lys Leu Xaa 
25 65 



(2) INFORMATION FOR SEQ ID lK>i 327: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 84 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

Met Gly Glu "Gly "Lys Ash Gly Phe Gly Gly Phe Val His Thr Ala Asp 
15 10 15 

40 Ala Cys Trp Glu Gly Val His Ser Glu Pro Val Cys Arg Thr Val His 
20 25 30 



Thr Val His Thr Cys His His Gin Ala Phe Leu Val Leu He Gly Trp 
35 40 45 

Ser Lys Ser Gly Lys Glu Arg Lys Glu Ala Phe Leu Thr Ala He He 
50 55 60 



Leu Asn Ser Arg Ser He His He Ser Cys Ser Trp Pro Pro Ser Pro 
50 65 70 75 80 

Val Pro Gin Xaa 



55 

(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 36 amino acids 
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(B) TYPE: amino acid 
(D) TOPOI/XSY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 

5 Met Leu Leu lie Asn Leu Leu Trp Leu Val Thr Met lie Lys Ser Val 
15 10 15 

lie Asn Asn Asn lie lie Leu Phe Leu Lys Lys Lys Ser Leu Phe Phe 
20 25 30 



10 



15 



35 



45 



55 



He Asp Ser Val 
35 



(2) INFORMATION FOR SEQ ID NO: 329: 



(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 63 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIC»I : SEQ ID NO: 329: 

Met Thr Phe Pro Phe Glu Lys Lys He Val Ala Phe Ser Ala Phe Tyr 
25 1 5 10 15 

Leu He Pro Gly Glu Ser Arg Leu Ala Pro Thr Phe Asn Pro Ser Ala 
20 25 30 

30 Asp Met Thr Val He Leu Arg Gly Arg Ala Gin His Lys Thr Ala Met 
35 40 45 



Leu Glu Ser Tyr Asn Trp Lys Val Ser Cys Gin Leu Arg Glu Xaa 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 330: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 



Met His Ser Lys Gly Ser Ser Leu Leu Leu Phe Leu Pro Gin Leu He 
15 10 15 



Leu He Leu Pro Val Cys Ala His Leu His Glu Glu Leu Asn Cys Cys 
50 20 25 30 



Phe His Arg 
35 



(2) INFORMATION FOR SEQ ID NO: 331: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

5 Met Gly Ala Leu Val Leu Leu Leu Cys Leu Leu Val Gly Val Gin Gin 
15 10 15 

Ser Gly Ser Val Trp Asp Ser 
20 

10 



(2) INFORMATION FOR SEQ ID NO: 332: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 



20 



30 



45 



Met Gin Ser Ala Glu He Leu Ser Trp Thr Asp Val Leu His Asp Phe 
1 5 10 15 



Leu Phe Ser Leu Phe Leu Trp Pro Ala Phe Glu Asp Arg Ala Leu Leu 
25 20 25 30 



He Phe Thr Leu Asn Gin He Val 
35 40 



(2) INFORMATION FOR SEQ ID NO: 333: 



(i) SEQUENCE CHAFIACTERISTICS : 
35 (A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGYT linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

40 Met Gin Ser Leu Val Gin Trp Gly Leu Asp Ser Tyr Asp Tyr Leu Gin 
1 5 10 15 

Asn Ala Pro Pro Gly Phe Phe Pro Arg Leu Gly Val He Gly Phe Ala 
20 25 30 



Gly Leu He Gly Leu Leu Leu Ala Arg Gly Ser Lys He Lys Lys Leu 
35 40 45 



Val Tyr Pro Pro Gly Phe Met Gly Leu Ala Ala Ser Leu Tyr Tyr Pro 
50 50 55 60 

Gin Gin Ala He Val Phe Ala Gin Val Ser Gly Glu Arg Leu Tyr Asp 
65 70 75 80 

55 Trp Gly Leu Arg Gly Tyr He Val He Glu Asp Leu Trp Lys Glu Asn 

85 90 95 



60 



Phe Gin Lys Pro Gly Asn Val Lys Asn Ser Pro Gly Thr Lys Xaa 
100 105 110 
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10 



25 



(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) L£IK?rH: 106 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

Met Ala Pro Ser Leu Leu Leu Leu Ala Pro Leu Cys Ser Leu Glu Ala 
1 5 10 15 



Val Leu Ser Ser Pro Leu Glu Lys Gin Cys Gin Leu Pro Gly lie Phe 
15 20 25 30 

Cys Gin Leu Gin Leu Pro Cys Pro Leu Leu Leu Ser Ala Gin Leu Leu 
35 40 45 

20 Lys Gly He Val Xaa Pro Arg Cys Pro Ala Ser Leu Pro Gin Pro Pro 
50 55 60 



His Pro Ala Pro Ser Trp His Leu Pro Leu His Cys Thr Glu Arg Xaa 
65 70 75 80 

Pro His His Leu Pro Leu Gin Gly Gly Ser Ser Asn Met Glu Glu Xaa 
85 90 95 



Asn Tyr Arg Gly Tyr Xaa Asp Ala Gin Lieu 
30 100 105 



35 



(2) INFORMATION FOR SEQ ID NO: 335: 



(i) SEQUENCE CHARACTERISTICS: - 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

<D) TOPOLOGY: linear \ 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

Met Thr Thr Cys Leu Phe Gly Leu Leu Ser Cys Glu Met Ser Ala Gin 
15 10 15 

45 Val Ser Gin Lys Ser Cys Val Tyr Asp Glu Ser Glu Cys Phe Ser Ser 
20 25 30 

Val Gly Gin Leu Leu Ala Leu Leu lie Leu Val Tyr Val Leu Pro Ser 
35 40 45 



50 



He Xaa 
50 



55 

(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 
60 (B) TYPE: amino acid 
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(D) T0POUX3Y: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

Met Leu Trp Lys Cys Ser Gin Asn He Ala Arg Cys Leu Leu Leu Leu 
5 1 5 10 ' 15 

Leu Ala Leu Val Glu He Lys Leu Glu Asp Leu Gin Ser Gin Leu His 
20 25 30 

10 Pro Thr Trp Lys Ser He Pro Gly Pro Ser Pro Arg Asn Gin His Arg 
35 40 45 



15 



(2) INFORMATION FOR SEQ ID NO: 337: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 



25 



35 



50 



60 



Met Leu He Pro Leu Gin Cys Leu Phe Ser Ser Asp Arg Met Leu Thr 
15 10 15 



Phe Leu Thr Pro Trp Gin Lys Gly Glu Lys Cys Val Leu Gly Trp Val 
30 20 25 30 



Thr Lys Phe Leu Ser Glu He Ser Xaa 
35 40 



- - 12r -INFORMATION-FOR SEQ' ID^NO: 338: " - ----- - _ - 

(i> SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

45 Met Thr Phe Ser Ser Leu Lys Leu Phe Val Leu Thr Cys He He Lys 
15 10 15 



Gly Leu Glu Arg Phe He He Leu Arg Glu Val Cys Asn Gin Glu He 
20 25 30 

Gin Arg Ser Leu Ser Ser Asn Leu Val His Val Leu Leu Gin Pro Ala 
35 40 45 



Thr Phe Lys Asp Val Leu Val Thr Glu He He Cys Leu Cys Met Cys 
55 50 55 60 



Leu Tyr Ser He Lys Tyr Met Pro Pro Gin Lys Lys 
65 70 75 



3DOCID: <WO__983944aA2JA> 
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(2) INFORMATICS FOR SEQ ID NO: 339: 

(i> SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

10 Lys Val Tyr lie Phe Leu lie Phe Met Val Leu lie Leu Pro Ser Leu 
15 10 15 



15 



25 



35 



50 



Gly Leu Thr Arg Tyr Met Pro Pro Xaa Ser Xaa Leu Asn Ser Glu 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 340: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 



Met Ala Lys lie Ser Pro Phe Glu Val Val Lys Arg Thr Ser Val Pro 
15 10 15 



Val Leu Val Gly Leu Val lie Val lie Val Ala Thr Glu Leu Met Val 
30 20 25 30 



Pro Gly Thr Ala Ala Ala Val Thr Gly Lys 
35 40 



(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

45 Met Arg Leu Phe Phe lie Gly Phe Leu Leu Leu Phe Ser Phe Gly Leu 
15 10 15 



Leu Arg Gin Pro Ser Leu Ser Ala Glu His 
20 25 



(2) INFORMATION FOR SEQ ID NO: 342: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

60 
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Met Val Phe Ser Val Ser Ser Ala Leu Ala L^u Leu Leu Met Leu L,eu 
15 10 15 

Arg Ser Ser Asp Leu Ala Lys Lys Thr Glu 
5 20 25 



10 



(2) INFORMATION FOR SEQ ID NO: 343: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

Met Ser Leu Glu Phe Tyr Gin Lys Lys Lys Ser Arg Trp Pro Phe Ser 
15 10 15 

20 Asp Glu Cys He Pro Trp Glu Val Trp Thr Val Lys Val His Val Val 
20 25 30 



25 



40 



Ala Leu Ala Thr Glu Gin Glu Arg Gin He Cys Arg Glu Lys Val Gly 
35 40 45 

Glu Lys Leu Cys Glu Lys He He Asn He Val Glu Val Met Asn Arg 
50 55 60 



His Glu Tyr Leu Pro Lys Met Pro Thr Gin Ser Glu Val Asp Asn Val 
30 65 70 75 80 

Phe Asp Thr Gly Leu Arg Asp Val Gin Pro Tyr Leu Tyr Lys He Ser 
85 90 95 

35 Phe Gin He Thr Asp Ala Leu Gly Thr Ser Val Thr Thr Thr Met Arg 
100 105 110 



Arg Leu He Lys Asp Thr Leu Pro Ser Glu Arg Arg Trp lie Ser Gly 

115 120 125 

Ser Ser Leu Met Ala Pro Arg Pro Trp Leu Leu Gly He Ala Leu Leu 
130 135 140 



Gly Leu Trp Ala Leu Glu Pro Ala Leu Gly His Trp Xaa 
45 145 150 155 



50 



(2) INFORMATION FOR SEQ ID NO: 344: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

Met Phe Leu Leu Pro Leu Pro Ala Ala Gly Arg Val Val Val Arg Arg 
15 10 15 

60 Leu Ala Val Arg Arg Phe Gly Ser Arg Ser Leu Ser Thr Ala Asp Met 
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- 5 



20 



35 



50 



20 25 30 

Thr Lys Gly Leu Val Leu Gly lie Tyr Ser Lys Glu Lys Glu Asp Asp 
35 40 45 

Val Pro Gin Phe Thr Ser Ala Gly Glu Asn Phe Asp Lys Leu Leu Ala 
50 55 60 



Gly Lys Leu Arg Glu Thr Leu Asn lie Ser Gly Pro Pro Leu Lys Ala 
10 65 70 75 80 

Gly Lys Thr Arg Thr Phe Tyr Gly Leu His Gin Asp Phe Pro Ser Val 
85 90 95 

15 Val Leu Val Gly Leu Gly Lys Lys Ala Ala Gly He Asp Glu Gin Glu 
100 105 110 



Asn Trp His Glu Gly Lys Glu Asn He Arg Ala Ala Val Ala Ala Gly 
115 120 125 

Cys Arg Gin lie Gin Asp Leu Glu Leu Ser Ser Val Glu Val Asp Pro 
130 135 140 



Cys Gly Asp Ala Gin Ala Ala Ala Glu Gly Ala Val Leu Gly Leu Tyr 
25 145 150 155 160 

Glu Tyr Asp Asp Leu Lys Gin Lys Lys Lys Met Ala Val Ser Ala Lys 
165 170 175 

30 Leu Tyr Gly Ser Gly Asp Gin Glu Ala Trp Gin Lys Gly Val Leu Phe 
180 185 190 



Ala Ser Gly Gin Asn Leu Ala Arg Gin Leu Met Glu Thr Pro Ala Asn 
195 200 205 

-Glu^ Met -Thr - P-ro JThr ^Arg_Phe JVla- Glu _Xle JEle-Glu -Ly^ Asn_Leu„ Lys - _ 
210 215 . 220 



Ser Ala Ser Ser Lys Thr Glu Val His He Arg Pro Lys Ser Trp He 
40 225 230 235 240 

Glu Glu Gin Ala Met Gly Ser Phe Leu Ser Val Ala Lys Gly Ser Asp 
245 250 255 

45 Glu Pro Pro Val Phe Leu Glu He His Tyr Lys Gly Ser Pro Asn Ala 
260 265 270 

Asn Glu Pro Pro Leu Val Phe Val Gly Lys Gly He Thr Phe Asp Ser 
275 280 285 



Gly Gly He Ser He Lys Ala Ser Ala Asn Met Asp Leu Met Arg Ala 
290 295 300 



Asp Met Gly Gly Ala Ala Thr He Cys Ser Ala He Val Ser Ala Ala 
55 305 310 315 320 

Lys Leu Asn I^u Pro He Asn He He Gly Leu Ala Pro Leu Cys Glu 
325 330 335 



60 



Asn Met Pro Ser Gly Lys Ala Asn Lys Pro Gly Asp Val Val Arg Ala 
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- 5 



20 



35 



45 



55 



340 345 350 

Lys Asn Gly Lys Thr lie Gin Val Asp Asn Thr Asp Ala Glu Gly Arg 
355 360 365 

Leu lie Leu Ala Asp Ala Leu Cys Tyr Ala His Thr Phe Asn Pro Lys 
370 375 380 



Xaa lie Leu Asn Ala Ala Thr Leu Thr Gly Ala Met Asp Val Ala Leu 
10 385 390 395 400 

Gly Ser Gly Ala Thr Gly Val Phe Thr Asn Ser Ser Trp Leu Trp Asn 
405 410 415 

15 Lys Leu Phe Glu Ala Ser He Glu Thr Gly Asp Arg Val Trp Arg Met 
420 425 430 



Pro Leu Phe Glu His Tyr Thr Arg Gin Val Val Asp Cys Gin Leu Ala 
435 440 445 

Asp Val Asn Asn He Gly Lys Tyr Arg Ser Ala Gly Ala Cys Thr Ala 
450 455 460 



Ala Ala Phe Leu Lys Glu Phe Val Thr His Pro Lys Trp Ala His Leu 
25 465 470 475 480 

Asp He Ala Gly Val Met Thr Asn Lys Asp Glu Val Pro Tyr Leu Arg 
485 490 495 

30 Lys Gly Met Thr Gly Arg Pro Thr Arg Thr Leu He Glu Phe Leu Leu 
500 505 510 



Arg Phe Ser Gin Asp Asn Ala Xaa 
515 520 



(2) INFORMATION FOR SEQ ID NO: 345: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: andno acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 



Thr He Leu Phe Leu Phe Leu Gin Leu Ser Ala Leu Arg Leu He Val 
15 10 15 



Gly Lys Asp Ser He Asp He Asp He Ser Ser Arg Arg Arg Glu Asp 
50 20 25 30 



Gin Ser Leu Arg Leu Asn Ala 
35 



(2) INFORMATION FOR SEQ ID NO: 346: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 234 amino acids 
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(B) TYPE: Ecnino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

5 Met Thr Ser Glu Leu Asp He Phe Val Gly Asn Thr Thr Leu He Asp 
15 10 15 

Glu Asp Val Tyr Arg Lieu Trp Leu Asp Gly Tyr Ser Val Thr Asp Ala 
20 25 30 



10 



Val Ala Leu Arg Val Arg Ser Gly He Leu Glu Gin Thr Gly Ala Thr 
35 40 45 



Ala Ala Val Leu Gin Ser Asp Thr Met Asp His Tyr Arg Thr Phe His 
15 50 55 60 

Met Leu Glu Arg Leu Leu His Ala Pro Pro Lys Leu Leu His Gin Leu 
65 70 75 80 

20 He Phe Gin He Pro Pro Ser Arg Gin Ala Leu Leu He Glu Arg Tyr 

85 90 95 



25 



Tyr Ala Phe Asp Glu Ala Phe Val Arg Glu Val Leu Gly Lys Lys Leu 
100 105 110 

Ser Lys Gly Thr Lys Lys Asp Leu Asp Asp He Ser Thr Lys Thr Gly 
115 120 125 



He Thr Leu Lys Ser Cys Arg Arg Gin Phe Asp Asn Phe Lys Arg Val 
30 130 135 140 

Phe Lys Val Val Glu Glu Met Arg Gly Ser Leu Val Asp Asn He Gin 
145 150 155 160 

35 Gin His Phe Leu Leu Ser Asp Arg Leu Ala Arg Asp Tyr Ala Ala He 

165 170 175 



40 



Val Phe Phe Ala Asn Asn Arg Phe Glu Thr Gly Lys Lys Lys L.eu Gin 
180 185 190 

Tyr Leu Ser Phe Gly Asp Phe Ala Phe Cys Ala Glu Leu Met He Gin 
195 200 205 



Asn Trp Thr Leu Gly Pro Val Asp Ser Gin Met Asp Asp Met Asp Met 
45 210 215 220 



50 



Asp Leu Asp Arg Asn Phe Ser Arg Thr Xaa 

225 230 



(2) INFORMATION FOR SEQ ID NO: 347: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

60 Met Ala Ala Ala Val Ala Gly Met Leu Arg Gly Gly Leu Leu Pro Gin 
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15 10 15 

Ala Gly Arg Leu Pro Thr Leu Gin Thr Val Arg Tyr Gly Ser Lys Ala 
20 25 30 

5 

Val Thr Arg His Arg Arg Val Met His Phe Gin Arg Gin Lys Leu Met 
35 40 45 

Ala Val Thr Glu Tyr lie Pro Pro Lys Pro Ala He His Pro Ser Cys 
10 50 55 60 

Leu Pro Ser Pro Pro Ser Pro Pro Gin Glu Glu He Gly Leu He Arg 
65 70 75 80 

15 Leu Leu Arg Arg Glu He Ala Ala Val Phe Gin Asp Asn Arg Met He 

85 90 95 



20 



Ala Val Cys Gin Asn Val Ala Leu Ser Ala Glu Asp Lys Leu Leu He 
100 105 110 

Ala Thr Pro Ala Ala Glu Thr Gin Asp Pro Asp Glu Gly Leu Pro Gin 
115 120 125 



Pro Gly Pro Glu Ser Pro Ser Trp Arg He Pro Ser Thr Lys He Cys 
25 130 135 140 

Cys Pro Phe Leu Trp Gly Thr Thr Cys Cys Trp Ser Val Lys Ser Pro 
145 150 155 160 

30 Arg Ser Arg Arg Trp Tyr Gly Ser Xaa 

165 



35 (2) INFORMATION FOR SEQ ID NO: 348: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 



45 



Met Lys Arg Ser Phe Leu Leu Pro Leu Leu Leu Val Gly Phe Leu Asp 
15 10 15 

Thr Ala His Leu He Leu Leu Glu Thr Leu Ser Val Cys Leu Trp Leu 
20 25 30 



Pro Ser Leu He Asp Ser Arg Cys Val Met Ser 
50 35 40 



55 



(2) INFORMATION FOR SEQ ID NO: 349: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 
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Met Lys Glu Gly Pro Pro Cys Lys Arg His His Tyr Tyr Gin Asn Cys 
1 5 10 15 

5 Gly Ala Lys Leu Leu Val Ser Leu Phe Gly Glu Thr Asn Gin lie His 
20 25 30 

Leu Leu Glu Thr Gin Val Gly Thr Glu Lys Gly Gly Glu Arg lie Trp 
35 40 45 

10 

Glu Glu Lys Trp Arg lie Ser Ser Thr Val Leu Phe lie Ser Val Asn 
50 55 60 

Ser Tyr Val Glu Gly Ser Val Leu Glu lie Lys Leu Phe Tyr 
15 65 70 75 



20 



(2) INFORMATION FOR SEQ ID NO: 350: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

Met Ser Glu lie Leu Ser Leu Leu Phe Cys Leu Leu Gly Pro Ala Leu 
15 10 15 

30 Asp Glu Arg Arg Glu Glu Lys Asp 
20 



35 (2) INFORMATION FOR SEQ ID NO: 351: 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEaNGTH: 274 £imino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 



Met Ser Ser Ala Gly Thr Ala Thr Pro Leu Glu Met Asp His Lys Leu 
15 10 15 

Thr Ser Gin Pro Gly Arg Pro Ser Phe Tyr Cys Asn Ser Arg His Ser 
20 25 30 



lie Val Gly Ser Ser His Gin Leu Gly Phe Trp Phe Ser His Leu Glu 
50 35 40 45 

Ser Ser Gly Leu Lys Val Phe Gin Val Ser Leu Pro Cys Glu Cys Val 
50 55 60 

55 Asn Leu Pro Thr Arg lie Ala Ser Val Val Leu Ser Leu Met Ser Leu 
65 70 75 80 



60 



Leu Val Val Gly Gin Ala Pro Ala Trp Glu Gly Ser Leu Leu Arg Gly 
85 90 95 
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Arg Pro Ala Gly Gly Ala His Leu Cys Ala Met Xaa Val lie Glu Gly 
100 105 110 

Leu Val Val Asp Val Gly Glu Arg lie Leu His Gly Gin Arg Glu Val 
5 115 120 125 

Gly Gin Val Ser Gin Val Leu Pro Ala Leu Ser Leu Gly Leu Val Phe 
130 135 140 

10 Leu Cys Gin Gly Thr Val Glu Lys Val Ser Gly Ala Ala His Cys Ser 
145 150 155 160 



15 



30 



35 



Ser Leu Leu Cys Cys Leu Pro Trp Gin Cys Ser Gly Gly Gly Phe Pro 
165 170 175 

Thr Xaa Arg Cys Ser Arg Pro Tyr Phe Ser Ser His Lys Gly Val Ala 
180 185 190 



Ala Thr Leu Ala Leu Thr Cys His Cys Asp Lys Val His Val Ala Gly 
20 195 200 205 

Leu Gly Lys Asp Trp Ala lie Glu Gin Arg Arg Arg Thr Cys Glu Ser 
210 215 220 

25 Asp Xaa Glu Xaa Xaa Pro Phe Thr Leu Ala Gly Leu Val Leu Val Leu 
225 230 235 240 



Arg Phe Cys Gin Val Val Leu Val Trp lie Pro Gin Leu Gly Asp Lys 
245 250 255 

His Trp Arg Gly Met Thr Arg Leu Gly Arg Val Ser Leu Thr Ser Ser 
260 265 270 

He Xaa 



40 



55 



(2) INFORMATION FOR SEQ ID NO: 352: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

Met He Phe Thr Ser Val Thr Lys Gly He Leu Leu He Ala Leu Trp 
15 10 15 

50 Val Pro Leu Phe His Phe Met Leu He Asp Ser He Leu Gly Pro Ser 
20 25 30 



Arg Leu Leu Thr Asp Gly Val Pro Phe Asn Pro Trp His Val Xaa 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 353: 
.60 (i) SEQUENCE CHARACTERISTICS: 



SDOCID: <WO_9839448A2_lA> 
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(A) LEMTTH: 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

- 5 

Met Lys Thr 
1 



10 

(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 354: 

Met Ser lie Ser Gly Thr Asp Gly Leu lie Leu Leu Leu Val Gly Leu 
20 1 5 10 15 

Glu Ala Xaa Val Arg Ser Ser Lys Lys Trp lie Pro Lys Ala Leu Xaa 
20 25 30 

25 Val Thr Gin Ala Lys Trp Asn Ser Trp Pro Ser Arg Arg Asn Ala Gly 
35 40 45 

Phe Ala Leu His 
50 

30 



(2) INFORMATION FOR SEQ ID NO: 355: 

35 (i) SEQUENCE CHARACTERISTICS: 

CA) .LENGTH: „J.a2„ amino, acids . 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 



40 



55 



Met Glu His Cys Leu Tyr His Ser Val His Gly lie Asn Pro Tyr lie 
15 10 15 



His Lys Asn Thr His Pro Ser He Asn He Tyr Met Val Trp Asp Glu 
45 20 25 30 

Gin Val Asn Ser Phe Glu Arg Glu Phe Val Pro Phe Phe Phe Leu He 
35 40 45 

50 He Leu Leu Asn Cys Cys Gin Leu Ser Asn Lys Gin Thr Glu Lys Leu 
50 55 60 



Phe Gly Lys Thr Leu His Thr Pro Phe Leu Ser Ser Ala Leu Lys Tyr 
65 70 75 80 

Arg Leu Asn Thr His He Leu Pro Val Phe Ser Tyr Ser Asp Ser He 
85 90 95 



Leu Thr Cys His Leu He Leu Ala Ser Tyr Phe Ser His Val Tyr Leu 
60 100 105 110 
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Pro Val Thr Cys He Cys Tyr Leu Asn Arg Lys Lys Asn He Gin Lys 
115 120 125 

5 Lys Lys Asn Xaa 
130 



10 (2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEKGTH: 204 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 



20 



35 



50 



Met Gly Ser Arg Asp His Leu Phe Lys Val Leu Val Val Gly Asp Ala 

15 10 15 

Ala Val Gly Lys Thr Ser Leu Val Gin Asp Tyr Ser Gin Asp Ser Phe 

20 25 30 



Ser Lys His Tyr Lys Ser Thr Val Gly Val Asp Phe Ala Leu Lys Val 
25 35 40 45 

Leu Gin Trp Ser Asp Tyr Glu lie Val Arg Leu Gin Leu Trp Asp lie 
50 55 60 

30 Ala Gly Gin Glu Arg Phe Thr Ser Met Thr Arg Leu Tyr Tyr Arg Asp 
65 70 75 80 



Ala Ser Ala Cys Val He Met Phe Asp Val Thr Asn Ala Thr Thr Phe 
85 90 95 

Ser Asn Ser Gin Arg Trp Lys Gin Asp Leu Asp Ser Lys Leu Thr Leu 

^QQ j^Q 



Pro Asn Gly Glu Pro Val Pro Cys Leu Leu Leu Ala Asn Lys Cys Asp 
40 115 120 125 

Leu Ser Pro Trp Ala Val Ser Arg Asp Gin He Asp Arg Phe Ser Lys 
130 135 140 

45 Glu Asn Gly Phe Thr Gly Trp Thr Glu Thr Ser Val Lys Glu Asn Lys 
145 150 155 160 



Asn He Asn Glu Ala Met Arg Val Leu He Glu Lys Met Met Arg Asn 
165 170 175 

Ser Thr Glu Asp He Met Ser Leu Ser Thr Gin Gly Asp Tyr He Asn 
180 185 190 



Leu Gin Thr Lys Ser Ser Ser Trp Ser Cys Cys Xaa 
55 195 200 



(2) INFORMATION FOR SEQ ID NO: 357: 

60 



SDOCJD: <WO__9839448A^IA> 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

- 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 

Met lie Ser Leu lie Phe Gin Leu Glu Glu Glu Lys Leu Val Glu Lys 
15 10 15 

10 Phe Phe Phe Phe Leu Phe Phe Phe Leu Lys Lys Gly Ser Gin Gly Ser 
20 25 30 

Asn Leu Lys lie Val Pro Arg His Met Arg Val Val Leu Arg Gly 
35 40 45 

15 



(2) INFORMATION FOR SEQ ID NO: 358: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

25 

Met Thr Tyr Val Thr Cys Leu His Val Cys Leu Leu Val Glu Phe Leu 
15 10 15 

Asn Ser Gin Leu Thr Asn His Arg Lys Tyr Tyr Phe Leu Ser Tyr Gly 
30 20 25 30 

Phe Trp Phe Thr Gly Leu Arg Gly Phe Ser Glu Tyr Leu Trp Pro Gin 
35 40 45 

35 Gin His Thr Ser Phe His Pro Asn Arg Asn Glu lie Asn Phe Val Ser 

. ,50 33 69 - ^ 

Thr Asp Asn Arg He Trp Val Thr Xaa 
65 70 

40 



(2) INFORMATION FOR SEQ ID NO: 359: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

50 

Met Ser Asp Gin Glu Ala Lys Pro Ser Thr Glu Asp Leu Gly Asp Lys 
15 10 15 

Lys Glu Gly Glu Tyr He Lys Leu Lys Val He Gly Gin Asp Ser Ser 
55 20 25 30 

Glu He His Phe Lys Val Lys Met Thr Thr His Leu Lys Lys Leu Lys 
35 40 45 

60 Glu Ser Tyr Cys Gin Arg Gin Gly Val Pro Met Asn Ser Leu Arg Phe 
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5 



15 



50 55 60 

Leu Phe Glu Gly Gin Arg lie Ala Asp Asn His Thr Pro Lys Glu Leu 
65 70 75 80 

Gly Met Glu Glu Glu Asp Val He Glu Val Tyr Gin Glu Gin Thr Gly 
85 90 95 



Gly His Ser Thr Val Xaa 

10 100 



(2) INFORMATION FOR SEQ ID NO: 360: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 eimino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

Met Gly Phe Pro Gin Trp His Leu Gly Asn His Ala Val Glu Pro Val 
15 10 15 

25 Thr Ser He Leu Leu Leu Phe Leu Leu Met Met Leu Gly Val Arg Gly 
20 25 30 

Leu Leu Leu Val Gly Leu Val Tyr Leu Val Ser His Leu Ser Gin Arg 
35 40 45 

30 



35 



(2) INFORMATION FOR SEQ ID NO: 361: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

Met Ser Ala Glu Val Lys Val Thr Gly Gin Asn Gin Glu Gin Phe Leu 
45 1 5 10 15 

Leu Leu Ala Lys Ser Ala Lys Gly Ala Ala Leu Ala Thr Leu lie His 
20 25 30 

50 Gin Val Leu Glu Ala Pro Gly Val Tyr Val Phe Gly Glu Leu Leu Asp 
35 40 45 



55 



Met Pro Asn Val Arg Glu Leu Ala Glu Ser Asp Phe Ala Ser Thr Phe 
50 55 60 

Arg Leu Leu Thr Val Phe Ala Tyr Gly Thr Tyr Ala Asp Tyr Leu Ala 
65 70 75 80 



Glu Ala Arg Asn Leu Pro Pro Leu Thr Glu Ala Gin Lys Asn Lys Leu 
60 85 90 95 
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Arg His Leu Ser Val Val Thr Leu Ala Ala Lys Val Lys Cys lie Pro 
100 105 110 

5 Tyr Ala Val Leu Leu Glu Ala Leu Ala Leu Arg Asn Val Arg Gin Leu 
115 120 125 

Glu Asp Leu Val lie Glu Ala Val Tyr Ala Asp Val Leu Arg Gly Ser 
130 135 140 

10 

Leu Asp Gin Arg Asn Gin Arg Leu Glu Val Asp Tyr Ser lie Gly Arg 
145 150 155 160 

Asp lie Gin Arg Gin Asp Leu Ser Ala lie Ala Arg Thr Leu Xaa Lys 
15 165 170 175 

Asn His Xaa 



20 



35 



(2) INFORMATION FOR SEQ ID NO: 362: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: lineair 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 

30 Met Lys Ser Ser Ser Leu Phe Phe Phe Phe Leu Ala His Phe lie His 
15 10 15 



Ser His Asp Leu Pro Gly Leu Cys Arg 
20 25 



(2) INFORMATION FOR SEQ ID NO: 363: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

45 

Met Lys Phe Ala Ala Ser Gly Xaa Phe Leu His His Met Ala Gly Leu 
1 5 10 15 

Ser Ser Ser Lys Leu Ser Met Ser Lys Ala Leu Pro Leu Thr Lys Val 
50 20 25 30 

Val Gin Asn Asp Ala Tyr Thr Ala Pro Ala Leu Pro Ser Ser lie Arg 
35 40 45 

55 Thr Lys Ala Leu Thr Asn Met Ser Arg Thr Leu Val Asn Lys Glu Glu 
50 55 60 



Pro Pro Lys Glu Leu Pro Ala Ala Glu Pro Val Leu Ser Pro Leu Glu 
65 70 75 80 
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Gly Thr Lys Met Thr Val Asn Asn Leu His Pro Arg Val Thr Glu Glu 
85 90 95 

Asp lie Val Glu Leu Phe Cys Val Cys Gly Ala Leu Lys Arg Ala Arg 
5 100 105 110 

Leu Val His Pro Gly Val Ala Glu Val Val Phe Val Lys Lys Asp Asp 
115 120 125 

10 Ala lie Thr Ala Tyr Lys Lys Tyr Asn Asn Arg Cys Leu Asp Gly Gin 
130 135 140 



15 



40 



55 



Pro Met Lys Cys Asn Leu His Met Asn Gly Asn Val lie Thr Ser Asp 
145 150 155 160 

Gin Pro He Leu Leu Arg Leu Ser Asp Ser Pro Ser Met Lys Lys Glu 
165 170 175 



Ser Glu Leu Pro Arg Arg Val Asn Ser Ala Ser Ser Ser Asn Pro Pro 
20 180 185 190 

Ala Glu Val Asp Pro Asp Thr He Leu Lys Ala Leu Phe Lys Ser Ser 
195 200 205 

25 Gly Ala Ser Xaa Thr Thr Gin Pro Thr Glu Phe Lys He Lys Leu Xaa 
210 215 220 



30 



(2) INFORMATION FOR SEQ ID NO: 364: 

35 (i) SEQUEa^CE CHARACTEMSTICS: 

(A) LENGTH: 349 amino acids 
f3j— ^iypg. ai^no aci'd 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 



Met Ser Lys Asn Cys He Lys Leu Leu Cys Glu Asp Pro Val Phe Ala 
15 10 15 



Glu Tyr He Lys Cys He Leu Met Asp Glu Arg Thr Phe Leu Asn Asn 
45 20 25 30 

Asn He Val Tyr Thr Phe Met Thr His Phe Leu Leu Lys Val Gin Ser 
35 40 45 

50 Gin Val Phe Ser Glu Ala Asn Cys Ala Asn Leu He Ser Thr Leu He 
50 55 60 



Thr Asn Leu He Ser Gin Tyr Gin Asn Leu Gin Ser Asp Phe Ser Asn 
65 70 75 80 

Arg Val Glu He Ser Lys Ala Ser Ala Ser Leu Asn Gly Asp Leu Arg 
85 90 95 



Ala Leu Ala Leu Leu Leu Ser Val His Thr Pro Lys Gin Leu Asn Pro 
60 100 105 110 
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10 



25 



Ala Leu lie Pro Thr Leu Gin Glu L,eu Leu Ser Lys Cys Arg Thr Cys 
115 120 125 

Leu Gin Gin Arg Asn Ser Leu Gin Glu Gin Glu Ala Lys Glu Arg Lys 
130 135 140 

Thr Lys Asp Asp Glu Gly Ala Thr Pro lie Lys Arg Arg Arg Val Ser 
145 150 155 160 

Ser Asp Glu Glu His Thr Val Asp Ser Cys lie Ser Asp Met Lys Thr 
165 170 175 



Glu Thr Arg Glu Val Leu Thr Pro Thr Ser Thr Ser Asp Asn Glu Thr 
15 180 185 190 

Arg Asp Ser Ser lie lie Asp Pro Gly Thr Glu Gin Asp Leu Pro Ser 
195 200 205 

20 Pro Glu Asn Ser Ser Val Lys Glu Tyr Arg Met Glu Val Pro Ser Ser 
210 215 220 



Phe Ser Glu Asp Met Ser Asn lie Arg Ser Gin His Ala Glu Glu Gin 
225 230 235 240 

Ser Asn Asn Gly Arg Tyr Asp Asp Cys Lys Glu Phe Lys Asp Leu His 
245 250 255 



Cys Ser Lys Asp Ser Thr Leu Ala Glu Glu Glu Ser Glu Phe Pro Ser 
30 260 265 270 

Thr Ser lie Ser Ala Val Leu Ser Asp Leu Ala Asp Leu Arg Ser Cys 
275 280 285 

35 Asp Gly Gin Ala Leu Pro Ser Gin Asp Pro Glu Val Ala Leu Ser Leu 

290 29S — 2m 

Ser Cys Gly His Ser Arg Gly Leu Phe Ser His Met Gin Gin His Asp 
305 310 315 320 

40 

He Leu Asp Thr Leu Cys Arg Thr He Glu Ser Thr He His Val Val 
325 330 335 

Thr Arg lie Ser Gly Lys Gly Asn Gin Ala Ala Ser Xaa 
45 340 345 



(2) INFORMATION FOR SEQ ID NO: 365: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 

Met Leu His Gin Asp His He Thr Phe Ala Met Leu Leu Ala Arg He 
15 10 15 



60 Lys Leu Lys Gly Thr Val Gly Glu Pro Thr Tyr Asp Ala Glu Phe Gin 
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20 25 30 

His Phe Leu Arg Gly Asn Glu lie Val Leu Ser Ala Gly Ser Thr Pro 
35 40 45 

5 

Arg He Gin Gly Leu Thr Val Glu Gin Ala Glu Ala Val Val Arg Leu 
50 55 60 

Ser Cys Leu Pro Ala Phe Lys Asp Leu He Ala Lys Val Gin Ala Asp 
10 65 70 75 80 

Glu Gin Phe Gly He Trp Leu Asp Ser Ser Ser Pro Glu Gin Thr Val 
85 90 95 

15 Pro Tyr Leu Trp Ser Glu Glu Thr Pro Ala Thr Pro He Gly Gin Ala 
100 105 110 



20 



35 



50 



He His Arg Leu Leu Leu He Gin Ala Phe Arg Pro Asp Arg Leu Leu 
115 120 125 

Ala Met Ala His Met Phe Val Ser Thr Asn Leu Gly Glu Ser Phe Met 
130 135 140 



Ser He Met Glu Gin Pro Leu Asp Leu Thr His He Val Xaa Thr Glu 
25 145 150 155 160 

Val Lys Pro Asn Thr Pro Val Leu Met Cys Ser Val Pro Gly Tyr Asp 
165 170 175 

30 Ala Ser Gly His Val Glu Asp Leu Ala Ala Glu Gin Asn Thr Gin He 
180 185 190 



Thr Ser He Ala He Gly Ser Ala Glu Gly Phe Asn Gin Ala Asp Lys 
195 200 205 

Ala He Asn Thr Ala Val Lys Ser Gly Arg Trp Val Met Leu Lys Asn 

-21i) 21S 220 

Val His Leu Ala Pro Gly Trp Leu Met Gin Leu Glu Lys Lys Leu His 
40 225 230 235 240 

Ser Leu Gin Pro His Ala Cys Phe Arg Leu Phe Leu Thr Met Glu He 
245 250 255 

45 Asn Pro Lys Val Pro Val Asn I^u Leu Arg Ala Gly Arg He Phe Val 
260 265 270 



Phe Glu Pro Pro Pro Gly Xaa Lys Ala Asn Met Leu Arg Thr Phe Ser 
275 280 285 

Ser He Pro Val Ser Arg He Cys Lys Ser Pro Asn Glu Arg Ala Arg 
290 295 300 



Leu Tyr Phe Leu Leu Ala Trp Phe His Ala He He Gin Glu Arg l^eu 
55 305 310 315 320 

Arg Tyr Ala Pro Leu Gly Trp Ser Lys Lys Tyr Glu Phe Gly Glu Ser 
325 330 335 

60 Asp Leu Arg Ser Xaa Cys Asp Thr Val Asp Thr Trp Leu Asp Asp Thr 
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340 345 350 

Ala Lys Gly Arg Gin Asn He Ser Pro Asp Lys He Pro Trp Ser Ala 
355 360 365 

5 

Leu Lys Thr Leu Met Ala Gin Ser He Tyr Gly Gly Arg Val Asp Asn 
370 375 380 

Glu Phe Asp Gin Arg Leu Leu Asn Thr Phe Leu Glu Arg Leu Phe Thr 
10 385 390 395 400 

Thr Arg Ser Phe Asp Ser Glu Phe Lys I^u Ala Cys Lys Val Asp Gly 
405 410 415 

15 His Lys Asp He Gin Met Pro Asp Gly Met Gin Ala Arg Gly Val Cys 
420 425 430 



20 



Ala Val Gly Gly Val Ala Pro Arg His Pro Asp Ala Leu Leu Ala Gly 
435 440 445 

Pro Ala Gin Gin Arg Arg Glu Ser Pro Pro Tyr His Thr Gly Cys Gly 
450 455 460 



His Asp Gin 
25 465 



(2) INFORMATION FOR SEQ ID NO: 366; 

30 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 152 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
35 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 



Met Ala Asp Glu Ala Thr Arg Arg Val Val Ser Glu He Pro Val Leu 
15 10 15 

40 Lys Thr Asn Ala Gly Pro Arg Asp Arg Glu Leu Trp Val Gin Arg Leu 
20 25 30 

Lys Glu Glu Tyr Gin Ser Leu He Arg Tyr Val Glu Asn Asn Lys Asn 
35 40 45 



45 



Ala Asp Asn Asp Trp Phe Arg Leu Glu Ser Asn Lys Glu Gly Thr Arg 
50 55 60 



Trp Phe Gly Lys Cys Trp Tyr He His Asp Leu Leu Lys Tyr Glu Phe 
50 65 70 75 80 

Asp He Glu Phe Asp He Pro He Thr Tyr Pro Thr Thr Ala Pro Glu 
85 90 95 

55 He Ala Val Pro Glu Leu Asp Gly Lys Thr Ala Lys Met Tyr Arg Gly 
100 105 HO 



60 



Gly Lys He Cys Leu Thr Asp His Phe Lys Pro Leu Trp Gly Gin Glu 
lis 120 125 
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Cys Ala Gin lie Trp Thr Ser Ser Ser His Gly Ser Gly Ala Gly Ser 
130 135 140 

Met Xaa Gly Ser Gly Asn Pro Xaa 
5 145 150 



10 



25 



40 



55 



(2) INFORMATION FOR SEQ ID NO: 367: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 

Met Tyr Asp Gly Thr Lys Glu Val Pro Met Asn Pro Val Lys lie Tyr 
15 10 15 

20 Gin Val Cys Asp lie Pro Gin Pro Gin Gly Ser lie He Asn Pro Gly 
20 25 30 



Ser Thr Gly Ser Ala Pro Trp Asp Glu Lys Asp Asn Asp Val Asp Glu 
35 40 45 

Glu Asp Glu Glu Asp Glu Leu Asp Gin Ser Gin His His Val Pro He 
50 55 60 



Gin Asp Thr Phe Pro Phe Leu Asn He Asn Gly Ser Pro Met Ala Pro 
30 65 70 75 80 

Ala Ser Val Gly Asn Cys Ser Val Gly Asn Cys Ser Pro Glu Ala Val 
85 90 95 

35 Trp Pro Lys Thr Glu Pro Leu Glu Met Glu Val Pro Gin Ala Pro He 
100 105 110 



Gin Pro Phe Tyr Ser Ser Pro Glu Leu Trp He Ser Ser Leu Pro Met 
115 120 125 

Thr Asp Leu Asp He Lys Phe Gin Tyr Arg Gly Lys Glu Tyr Gly Gin 
130 135 140 



Thr Met Thr Val Ser Asn Pro Gin Gly Cys Arg Leu Phe Tyr Gly Asp 
45 145 150 155 160 

Leu Gly Pro Met Pro Asp Gin Glu Glu Leu Phe Gly Pro Val Xaa Leu 
165 170 175 

50 Glu Gin Val Lys Phe Pro Gly Pro Glu His He Thr Asn Glu Lys Gin 
180 185 190 



Lys Leu Phe Thr Ser Lys Leu Leu Asp Val Met Asp Arg Gly Leu He 
195 200 205 

Leu Glu Val Ser Gly His Ala He Tyr Ala He Arg Leu Cys Gin Cys 
210 215 220 



Lys Val Tyr Trp Ser Gly Pro Cys Ala Pro Ser Leu Val Ala Pro Asn 
60 225 230 235 240 
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Leu He Glu Arg Gin Lys Lys Val Lys Leu Phe Cys Leu Glu Thr Phe 
245 250 255 

5 I^u Ser Asp Leu He Ala His Gin Lys Gly Gin He Glu Lys Gin Pro 
260 265 270 



10 



Pro Phe Glu He Tyr Leu Cys Phe Gly Glu Glu Trp Pro Asp Gly Lys 
275 280 285 

Pro Leu Glu Arg Lys Leu He Leu Val Gin Val He Pro Val Val Ala 
290 295 300 



Arg Met He Tyr Glu Met Phe Ser Gly Asp Phe Thr Arg Ser Phe Asp 
15 305 310 315 320 

Ser Gly Ser Val Arg Leu Gin He Ser Thr Pro Asp He Lys Asp Asn 
325 330 335 

20 He Val Ala Gin Leu Lys Gin Leu Tyr Arg He Leu Gin Thr Gin Glu 
340 345 350 



25 



30 



Ser Trp Gin Pro Met Gin Pro Thr Pro Ser Met Gin Leu Pro Pro Ala 
355 360 365 

Leu Pro Pro Gin Xaa 
370 



(2) INFORMATION FOR SEQ ID NO: 368: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 
35 (B) TYPE: amino acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

Met Gly Ser Ser Val Leu Pro Phe Cys Val Cys Val Thr Ser Pro Ser 
40 1 5 10 15 

Leu Gly Gly Arg Cys He Gin Gly Arg Phe Ala Ser His Ser Lys Phe 
20 25 30 

45 Trp Gly Phe Gly Arg Lys Thr Ala Ser Phe Gly Ala Val Gly Glu Thr 
35 40 45 



50 



55 



Pro Pro Asp Gin Glu Pro Gin Lys Glu Thr Glu Pro Ala Thr Ser Ser 
50 55 60 

His Ala Arg Pro Trp Ala Arg Val He Gly Leu Arg He Trp Pro Gin 
65 70 75 80 

Pro Asn Xaa 



60 



(2) INFORMATION FOR SEQ ID NO: 369: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTIC^a : SEQ ID NO: 369: 

Met Leu Leu Ser Val Ala lie Phe lie Leu Leu Thr Leu Val Tyr Ala 
15 10 15 

10 Tyr Trp Thr Met Xaa 
20 



15 (2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 



25 



Met Gly Ala Ser Ala Arg Leu Leu Arg Ala Val lie Met Gly Ala Pro 
15 10 15 

Gly Ser Gly Lys Gly Thr Val Ser Ser Arg lie Thr Thr His Phe Glu 
20 25 30 



Leu Lys His Leu Ser Ser Gly Asp Leu Leu Arg Asp Asn Met Leu Arg 
30 35 40 45 

Gly Thr Glu lie Gly Val Leu Ala Lys Ala Phe lie Asp Gin Gly Lys 
50 55 60 

35 Leu lie Pro Asp Asp Val Met Thr Arg Leu Ala Leu His Glu Leu Lys 
65 70 75 80 



40 



Asn Leu Thr Gin Tyr Ser Trp Leu Leu Asp Gly Phe Pro Arg Thr Leu 
85 90 95 

Pro Gin Ala Glu Ala Leu Asp Arg Ala Tyr Gin lie Asp Thr Val lie 
100 105 110 



Asn Leu Asn Val Pro Phe Glu Val lie Lys Gin Arg Leu Thr Ala Arg 
45 115 120 125 

Trp lie His Pro Ala Ser Gly Arg Val Tyr Asn lie Glu Phe Asn Pro 
130 135 140 

50 Pro Lys Thr Val Gly He Asp Asp Leu Thr Gly Glu Pro Leu He Gin 
145 150 155 160 



55 



Arg Glu Asp Asp Lys Pro Glu Thr Val He Lys Arg Leu Lys Ala Tyr 
165 170 175 

Glu Asp Gin Thr Lys Pro Val Leu Glu Tyr Tyr Gin Lys Lys Gly Val 
180 185 190 



Leu Glu Thr Phe Ser Gly Thr Glu Thr Asn Lys He Trp Pro Tyr Val 
60 195 200 205 
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Tyr Ala Phe Leu Gin Thr Lys Val Pro Gin Arg Ser Gin Lys Ala Ser 
210 215 220 

5 Val Thr Pro 
225 



10 (2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGy; linear 

(xi) SEQUENCE DESCRIPTIC»^: SEQ ID NO: 371: 



20 



Met Phe Leu Asn Cys Glu He Leu Glu Tyr Cys Tyr Tyr Leu Thr Gin 
15 10 15 

Leu Lys He Ser Met Gly Lys Tyr Leu Ser He Pro Thr Val Leu Leu 
20 25 30 



Lys He He Arg Cys Ser He Thr Ala Val Ser Asp Ser Ser Thr Ser 
25 35 40 45 

Trp Ala He Lys Ala Gin Leu Lys He Glu Asn Lys Asp Leu Asp Asn 
50 55 60 

30 Lys Thr Ala Lys Gly Gly Gly Gin Glu Ala Leu Thr Cys Thr Xaa 
65 70 ' 75 



35 (2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 



45 



55 



Met Arg Ala Val Phe Pro Cys Cys Pro Phe Leu Thr Leu Met Leu Pro 
15 10 15 

Leu Leu Glu Cys Leu Val Gly Met He Met Cys Tyr Leu Gly He Ser 
20 25 30 



Phe Thr Asp Thr Arg Lys Thr Ala Gly Leu Lys Lys Lys Lys Lys Lys 
50 35 40 45 



Lys Xaa Xaa 
50 



(2) INFORMATION FOR SEQ ID NO: 373: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 
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10 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

Met Phe Leu Met Arg Met His Leu Cys Phe Cys Lys Tyr Cys Cys Ser 
15 10 15 

Phe lie Val Thr Pro Thr Ser Thr Ser Asn Thr Ala Ser Tyr Leu Trp 
20 25 30 

Pro Trp lie Ser Ala Ser Met Ala Gly Arg Gly Ser Ser Trp Ala Cys 
35 40 45 



Thr Leu Asn Ala Val Thr Arg Glu Gly Leu Pro Glu Xaa 
15 50 55 60 



20 



(2) INFORMATION FOR SEQ ID NO: 374: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

Met Ser Leu Leu Asn Thr His Thr Leu Cys Phe Val Leu Phe Cys Phe 
15 10 15 

30 Thr Leu Ser lie Asn Gin Glu Lys Leu Ala Asn His Leu Ala Phe Arg 
20 25 30 



35 



He Leu Phe Phe He Val Phe Xaa 
35 40 



(2) INFORMATION FOR SEQ ID NO: 375: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 



45 



Met Cys Ser Gly Gin Ser Gin Val Trp Lys Met Ala Leu Gin Ala Leu 
15 10 15 



Asp Ser Glu Thr Val Val He Leu Pro Asp Met His Leu He Leu Ser 
50 20 25 30 



55 



Leu Arg Leu He His Asn Ala Arg Pro Cys Leu Xaa 
35 40 



(2) INFORMATION FOR SEQ ID NO: 376: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 203 amino acids 
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10 



25 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

Met Leu lie Ser Glu Glu Glu lie Pro Phe Lys Asp Asp Pro Arg Asp 
15 10 15 

Glu Thr Tyr Lys Pro His Leu Glu Arg Glu Thr Pro Lys Pro Arg Arg 
20 25 30 

Lys Ser Gly Lys Val Lys Glu Glu Lys Glu Lys Lys Glu lie Lys Val 
35 40 45 



Glu Val Glu Val Glu Val Lys Glu Glu Glu Asn Glu lie Arg Glu Asp 
15 50 55 60 

Glu Glu Pro Pro Arg Lys Arg Gly Arg Arg Arg Lys Asp Asp Lys Ser 
65 70 75 80 

20 Pro Arg Leu Pro Lys Arg Arg Lys Lys Pro Pro lie Gin Tyr Val Arg 

85 90 95 



Cys Glu Met Glu Gly Cys Gly Thr Val Leu Ala His Pro Arg Tyr Leu 
100 105 110 

Gin His His He Lys Tyr Gin His Leu Leu Lys Lys Lys Tyr Val Cys 
115 120 125 



Pro His Pro Ser Cys Gly Arg Leu Phe Arg Leu Gin Lys Gin Leu Leu 
30 130 135 140 

Arg His Ala Lys His His Thr Asp Gin Arg Asp Tyr He Cys Glu Tyr 
145 150 155 160 

35 Cys Ala Arg Ala Phe Lys Ser Ser His Asn Leu Ala Val His Arg Met 
^^.^ 

He His Thr Gly Glu Lys His Tyr Asn Val Arg Ser Val Asp Leu Leu 
180 185 190 

40 

Val Asp Lys Arg His Leu Leu He Gly Thr Xaa 
195 200 



45 

(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

Met Leu Pro Arg Arg Thr Phe Tyr Phe Tyr Phe He Phe He Phe Phe 
55 1 5 10 15 

Leu Ala Ser Phe Trp Gly Phe Thr Leu Arg Ala Ser Phe 
20 25 



60 
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(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHAKACTERISTICS : 
5 (A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 

10 Met Phe Asp Ser Leu Ser Tyx Phe Lys Gly Ser Ser Leu Leu Leu Met 
15 10 15 



15 



30 



Leu Lys Thr Tyr Leu Ser Glu Asp Val Phe Gin His Ala Val Val Leu 
20 25 30 

Tyr Leu His Asn His Ser Tyr Ala Ser lie Gin Ser Asp Asp Leu Trp 
35 40 45 



Asp Ser Phe Asn Glu Val Thr Asn Gin Thr Leu Asp Val Lys Arg Met 
20 50 55 60 

Met Lys Thr Trp Thr Leu Gin Lys Gly Phe Pro Leu Val Thr Val Gin 
65 70 75 80 

25 Lys Lys Gly Lys Glu Leu Phe lie Gin Gin Glu Arg Phe Phe Leu Asn 

85 90 95 



Met Lys Pro Glu lie Gin Pro Ser Asp Thr Arg Tyr Met Pro Ser Phe 
100 105 110 

Phe Ser Cys His Leu Phe Cys Thr Leu Arg Trp Lys Tyr Phe Glu Val 
115 120 125 



Phe Tyr Asn His Lys Phe Leu Xaa 
35 130 • 135 



40 



55 



{2) INFORMATION FOR SEQ ID NO: 379: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 

Met Ala Trp Arg Arg Arg Glu Pro Ala Ser Gly Leu Ala Ala Cys Trp 
15 10 15 

50 Leu Trp Arg Cys Ser Pro Trp Pro Cys Ala Cys Pro Gly Pro Gly Ala 
20 25 . 30 



Gly Leu Ser Ser Gly Ser Arg Pro Trp 
35 40 



(2) INFORMATION FOR SEQ ID NO: 380: 
60 (i) SEQUENCE CHARACTERISTICS: 



SDOCID: <WO_983944aA2JA> 



580 



(A) LENGTH: 468 amino acids 

(B) TYPE:, amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

Met Glu Phe Leu Lys Val Ala Arg Arg Asn Lys Arg Glu Gin Leu Glu 
15 10 15 

Gin lie Gin Lys Glu Leu Ser Val Leu Glu Glu Asp lie Lys Arg Val 
20 25 30 

Glu Glu Met Ser Gly Leu Tyr Ser Pro Val Ser Glu Asp Ser Thr Val 
35 40 45 

Pro Gin Phe Glu Ala Pro Ser Pro Ser His Ser Ser He He Asp Ser 
50 55 60 

Thr Glu Tyr Ser Gin Pro Pro Gly Phe Ser Gly Ser Ser Gin Thr Lys 
65 70 75 80 

Lys Gin Pro Trp Tyr Asn Ser Thr Leu Ala Ser Arg Arg Lys Arg Leu 
85 90 95 

Thr Ala His Phe Glu Asp Leu Glu Gin Cys Tyr Phe Ser Thr Arg Met 
100 105 110 

Ser Arg He Ser Asp Asp Ser Arg Thr Ala Ser Gin Leu Asp Glu Phe 
115 120 125 

Gin Glu Cys Leu Ser Lys Phe Thr Arg Tyr Asn Ser Val Arg Pro Leu 
130 135 140 

Ala Thr Leu Ser Tyr Ala Ser Asp Leu Tyr Asn Gly Ser Ser He Val 
145 150 155 160 

" Ser^Ser Tie TSlu^Phe'AsprAr^" Asp "Cys" Asp T^t^ ^he-Ala- H^e^ Ala Gly^ 
165 170 175 

Val Thr Lys Lys He Lys Val Tyr Glu Tyr Asp Thr Val He Gin Asp 
180 185 190 



Ala Val Asp He His Tyr Pro Glu 
195 200 

He Ser Cys He Ser Trp Ser Ser 
210 215 

Ser Asp Tyr Glu Gly Thr Val He 
225 230 

Arg Ser Lys Val Tyr Gin Glu His 
245 

Phe Asn Leu Met Asp Pro Lys Leu 
260 

Lys Val Lys Leu Trp Ser Thr Asn 
275 280 

Glu Ala Lys Ala Asn Val Cys Cys 



Asn Glu Met Thr Cys Asn Ser Lys 
205 

Tyr His Lys Asn Leu Leu Ala Ser 
220 

Leu Trp Asp Gly Phe Thr Gly Gin 
235 240 

Glu Lys Arg Cys Trp Ser Val Asp 
250 255 

Leu Ala Ser Gly Ser Asp Asp Ala 
265 270 

Leu Asp Asn Ser Val Ala Ser He 
285 

Val Lys Phe Ser Pro Ser Ser Arg 
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20 



35 



581 



290 295 300 

Tyr His Leu Ala Phe Gly Cys Ala Asp His Cys Val His Tyr Tyr Asp 
305 310 315 320 

Leu Arg Asn Thr Lys Gin Pro lie Met Val Phe Lys Gly His Arg Lys 
325 330 335 



Ala Val Ser Tyr Ala Lys Phe Val Ser Gly Glu Glu lie Val Ser Ala 
10 340 345 350 

Ser Thr Asp Ser Gin Leu Lys Leu Trp Asn Val Gly Lys Pro Tyr Cys 
355 360 365 

15 Leu Arg Ser Phe Lys Gly His lie Asn Glu Lys Asn Phe Val Gly Leu 
370 375 380 



Ala Ser Asn Gly Asp Tyr lie Ala Cys Gly Ser Glu Asn Asn Ser Leu 
385 390 395 400 

Tyr Leu Tyr Tyr Lys Gly Leu Ser Lys Thr Leu Leu Thr Phe Lys Phe 
405 410 415 



Asp Thr Val Lys Ser Val Leu Asp Lys Asp Arg Lys Glu Asp Asp Thr 
25 420 425 430 

Asn Glu Phe Val Ser Ala Val Cys Trp Arg Ala Leu Pro Asp Gly Glu 
435 440 445 

30 Ser Asn Val Leu He Ala Ala Asn Ser Gin Gly Thr He Lys Val Leu 
450 455 460 



Glu Leu Val Xaa 
465 



(2) INFORMATION FOR SEQ ID NO: 381: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

45 

Met Arg Lys Glu Asp Gly Phe Trp Phe Phe Phe Phe Leu Phe Phe Phe 
15 10 15 

Val Val Gly Ser Lys Phe Val Asn Gly Asn Lys Leu Val 
50 20 25 



55 



(2) INFORMATION FOR SEQ ID NO: 382: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 
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Met Pro Leu Ala Pro Tyx Cys Asp Leu Leu Val Ala Leu Ser Phe Ala 
1 5 10 .15 

5 Leu Val Leu Glu Ser Pro Val Asp Ser Ser Asp Phe Thr 
20 25 



10 (2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

Met Asn Ser Leu Val Ser Trp Gin Leu Leu Leu Phe Leu Cys Ala Thr 
1 5 . 10 15 

His Phe Gly Glu Pro Leu Glu Lys Val Ala Ser Val Gly Asn Ser Arg 
20 25 30 



20 



Pro Thr Gly Gin Gin Leu Glu Ser Leu Gly Leu Leu Ala Pro Gly Glu 
25 35 40 45 

Gin Ser Leu Pro Cys TJ-jt Glu Arg Lys Pro Ala Ala Thr Ala Arg Leu 
50 55 60 

30 Ser Arg Arg Gly Thr Ser Leu Ser Pro Pro Pro Glu Ser Ser Gly Ser 
65 70 75 80 



35 



Pro Gin Gin Pro Gly Leu Ser Ala Pro His Ser Arg Gin lie Pro Ala 
85 90 95 

-Pr o -Gln-Gly^ Ala ^a-1- Leu— Val -Gin— Arg-Glu-Lys- Asp-Leu— Pro ^Asn -Tyr - 
100 105 110 



Asn Trp Asn Ser Phe Gly Leu Arg Phe Gly Lys Arg Glu Ala Ala Pro 
40 115 120 125 



45 



Gly Asn His Gly Arg Ser Ala Gly Arg Gly 
130 135 



(2) INFORMATION FOR SEQ ID NO: 384: 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 384: 

55 Met Ser Cys Phe lie Asp Ser Xaa Asp Ser Lys lie Leu His Leu Leu 
15 10 15 

Val Val Ser Phe lie Cys Xaa Leu Phe Leu Leu He Leu Thr His Gly 
20 25 30 

60 
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He Leu He Leu Arg Xaa Phe Phe Ser Val Xaa Xaa His Ser Leu Lys 

35 40 45 

Asn Asn Leu Glu Glu Tyr Leu He Leu Met Asn Lys Ala Leu Leu Thr 

- 5 50 55 60 



10 



Arg Glu Asp Phe Phe Val Leu Pro Xaa Ala 
65 I 70 



(2) INFORMATION FOR SEQ ID NO: 385: 



<i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 521 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGV: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

20 Met Ser Ala Gly Glu Val Glu Arg Leu Val Ser Glu Leu Ser Gly Gly 
15 10 15 

Thr Gly Gly Asp Glu Glu Glu Glu Trp Leu Tyr Gly Asp Glu Asn Glu 
20 25 30 



25 



Val Glu Arg Pro Glu Glu Glu Asn Ala Ser Ala Asn Pro Pro Ser Gly 
35 40 45 



He Glu Asp Glu Thr Ala Glu Asn Gly Val Pro Lys Pro Lys Val Thr 
30 50 - 55 60 

Glu Thr Glu Asp Asp Ser Asp Ser Asp Ser Asp Asp Asp Glu Asp Asp 
65 70 75 80 

35 Val His Val Thr He Gly Asp He Lys Thr Gly Ala Pro Gin Tyr Gly 

85 90 95 



40 



Ser Tyr Gly Thr Ala Pro Val Asn Leu Asn He Lys Thr Gly Gly Arg 
100 105 110 

Val Tyr Gly Thr Thr Gly Thr Lys Val Lys Gly Val Asp Leu Asp Ala 

115 120 125 



Pro Gly Ser He Asn Gly Val Pro Leu Leu Glu Val Asp Leu Asp Ser 
45 130 135 140 

Phe Glu Asp Lys Pro Trp Arg Lys Pro Gly Ala Asp Leu Ser Asp Tyr 
145 150 155 160 

50 Phe Asn Tyr Gly Phe Asn Glu Asp Thr Trp Lys Ala Tyr Cys Glu Lys 

165 170 175 



55 



Gin Lys Arg He Arg Met Gly Leu Glu Val He Pro Val Thr Ser Thr 
180 185 190 

Thr Asn Lys He Thr Val Gin Gin Gly Arg Thr Gly Asn Ser Glu Lys 
195 200 205 



Glu Thr Ala Leu Pro Ser Thr Lys Ala Glu Phe Thr Ser Pro Pro Ser 
60 210 215 220 
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10 



25 



40 



Leu Phe Lys Thr Gly Leu Pro Pro Ser Arg Arg Leu Pro Gly Ala He 
225 230 235 240 

Asp Val He Gly Gin Thr He Thr He Ser Arg Val Glu Gly Arg Arg 
245 250 255 

Arg Ala Asrx Glu Asn Ser Asn He Gin Val Leu Ser Glu Arg Ser Ala 
260 265 270 

Thr Glu Val Asp Asn Asn Phe Ser Lys Pro Pro Pro Phe Phe Pro Pro 
275 280 285 



Gly Ala Pro Pro Thr His Leu Pro Pro Pro Pro Phe Leu Pro Pro Pro 

15 290 295 300 

Pro Thr Val Ser Thr Ala Pro Pro Leu He Pro Pro Pro Gly Phe Pro 

305 310 315 320 

20 Pro Pro Pro Gly Ala Pro Pro Pro Ser Leu He Pro Thr He Glu Ser 

325 330 335 



Gly His Ser Ser Gly Tyr Asp Ser Arg Ser Ala Arg Ala Phe Pro Tyr 
340 345 350 

Gly Asn Val Ala Phe Pro His Leu Pro Gly Ser Ala Pro Ser Trp Pro 

355 360 365 



Ser Leu Val Asp Thr Ser Lys Gin Trp Asp Tyr Tyr Ala Arg Arg Glu 
30 370 375 380 

Lys Asp Arg Asp Arg Glu Arg Asp Arg Asp Arg Glu Arg Asp Arg Asp 
385 390 395 400 

35 Arg Asp Arg Glu Arg Glu Arg Thr Arg Glu Arg Glu Arg Glu Arg Asp 

405 410 415 



His Ser Pro Thr Pro Ser Val Phe Asn Ser Asp Glu Glu Arg Tyr Arg 
420 425 430 

Tyr Arg Glu Tyr Ala Glu Arg Gly Tyr Glu Arg His Arg Ala Ser Arg 
435 440 445 



Glu Lys Glu Glu Arg His Arg Glu Arg Arg His Arg Glu Lys Glu Glu 
45 450 455 460 

Thr Arg His Lys Ser Ser Arg Ser Asn Ser Arg Arg Arg His Glu Ser 
465 470 475 480 

50 Glu Glu Gly Asp Ser His Arg Arg His Lys His Lys Lys Ser Lys Arg 

485 490 495 



55 



60 



Ser Lys Glu Gly Lys Glu Ala Gly Ser Glu Pro Ala Pro Glu Gin Glu 
500 505 510 

Ser Thr Glu Ala Thr Pro Ala Glu Xaa 
515 520 
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(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 
5 <B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 

Met Asn Ser Arg Gly lie Trp Leu Ala Tyr lie lie Leu Val Gly Leu 
10 1 5 10 15 

Leu His Met Val Leu Leu Ser lie Pro Phe Phe Ser lie Pro Val Val 
20 25 30 

15 Trp Thr Leu Thr Asn Val lie His Asn Leu Ala Thr Tyr Val Phe Leu 
35 40 45 



20 



His Thr Val Lys Gly Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ala 
50 55 60 

Arg Leu Leu Thr His Trp Glu Gin Met Asp Tyr Gly Leu Gin Phe Thr 
65 70 75 80 



Ser Ser Arg Lys Phe Leu Ser lie Ser Pro lie Val Leu Tyr Leu Leu 
25 85 90 95 

Ala Ser Phe Tyr Thr Lys Tyr Asp Ala Ala His Phe Leu lie Asn Thr 
100 105 110 

30 Ala Ser Leu Leu Ser Val Leu Leu Pro Lys Leu Pro Gin Phe His Gly 
115 120 125 



35 



Val Arg Val Phe Gly He Asn Lys Tyr 
130 135 



(2) INFORMATION FOR SEQ ID NO: 387: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 



45 



Met Ala Ala Gin Lys Asp Gin Gin Lys Asp Ala Glu Ala Glu Gly I*eu 
15 10 15 



Ser Gly Thr Thr Leu Leu Pro Lys Leu He Pro Ser Gly Ala Gly Arg 
50 20 25 30 

Glu Trp Leu Glu Arg Arg Arg Ala Thr He Arg Pro Trp Ser Thr Phe 
35 40 45 

55 Val Asp Gin Gin Arg Phe Ser Arg Pro Arg Asn Leu Gly Glu Leu Cys 
50 55 60 



60 



Gin Arg Leu Val Arg Asn Val Glu Tyr Tyr Gin Ser Asn Tyr Val Phe 
65 70 75 80 
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Val Phe Leu Gly Leu lie Leu Tyr Cys Val Val Thr Ser Pro Met Leu 
85 90 95 

Leu Val Ala Leu Ala Val Phe Phe Gly Ala Cys Tyr lie Leu Tyr Leu 
5 100 105 . 110 

Arg Thr Leu Glu Ser Lys Leu Val Leu Phe Gly Arg Glu Val Ser Pro 
115 120 125 

10 Ala His Gin Tyr Ala Leu Ala Gly Gly lie Ser Phe Pro Phe Phe Trp 
130 135 140 

Leu Ala Gly Ala Gly Ser Ala Val Phe Trp Val Leu Gly Ala Thr Leu 
145 150 155 160 

15 

Val Val lie Gly Ser His Ala Ala Phe His Gin lie Glu Ala Val Asp 
165 170 175 

Gly Glu Glu Leu Gin Met Glu Pro Val Xaa 
20 180 185 



(2) INFORMATION FOR SEQ ID NO: 388: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

Met 
1 

35 



(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 299 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

45 Met Leu Ser He Phe Tyr Phe Ala lie Pro Val Gly Ser Gly Leu Gly 
15 10 15 

Tyr He Ala Gly Ser Lys Val Lys Asp Met Ala Gly Asp Trp His Trp 
20 25 30 

50 

Ala Leu Arg Val Thr Pro Gly Leu Gly Val Val Ala Val Leu Leu Leu 
35 40 45 

Phe Leu Val Val Arg Glu Pro Pro Arg Gly Ala Val Glu Arg His Ser 
55 50 55 60 

Asp Leu Pro Pro Leu Asn Pro Thr Ser Trp Trp Ala Asp Leu Arg Ala 
65 70 75 80 

60 Leu Ala Arg Asn Pro Ser Phe Val Leu Ser Ser Leu Gly Phe Thr Ala 
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85 90 95 

Val Ala Phe Val Thr Gly Ser Leu Ala Leu Trp Ala Pro Ala .Phe Leu 
100 105 110 

5 

Leu Arg Ser Arg Val Val Leu Gly Glu Thr Pro Pro Cys Leu Pro Gly 
115 120 125 

Asp Ser Cys Ser Ser Ser Asp Ser Leu lie Phe Gly Leu lie Thr Cys 
10 130 135 140 

Leu Thr Gly Val Leu Gly Val Gly Leu Gly Val Glu lie Ser Arg Arg 
145 150 155 160 

15 Leu Arg His Ser Asn Pro Arg Ala Asp Pro Leu Val Cys Ala Thr Gly 

165 170 175 



20 



35 



Leu Leu Gly Ser Ala Pro Phe Leu Phe Leu Ser Leu Ala Cys Ala Arg 
180 185 190 

Gly Ser lie Val Ala Thr Tyr lie Phe He Phe He Gly Glu Thr Leu 
195 200 205 



Leu Ser Met Asn Trp Ala He Val Ala Asp He Leu Leu Tyr Val Val 
25 210 215 220 

He Pro Thr Arg Arg Ser Thr Ala Glu Ala Phe Gin He Val Leu Ser 
225 230 235 240 

30 His Leu Leu Gly iKsp Ala Gly Ser Pro Tyr Leu He Gly Leu He Ser 

245 250 255 



Asp Arg Leu Arg Arg Asn Trp Pro Pro Ser Phe Leu Ser Glu Phe Arg 
260 265 270 

Ala Leu Gin Phe Ser Leu Met Leu Cys Ala Phe Val Gly Ala Leu Gly 

_ - 275 280 ~^85 



Gly Ala Leu Pro Gly His Arg His Leu His X^lSl 
40 290 295 



45 



(2) INFORMATION FOR SEQ ID NO: 390: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

Met Gly Pro Gin Gly Trp Val Arg Pro Leu Lys Thr Ala Pro Lys Leu 
15 10 15 

55 Gly Glu Ala He Arg Leu He Leu Phe Leu Asn Phe Val Lys Gin Cys 
20 25 30 

He Ala Ser Val Asn Leu Cys He Leu Arg Leu Asn He Thr Pro Leu 
35 40 45 

60 
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Leu 



- 5 

(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

Met Tyr Val Asn Tyr Gly Thr Arg Asn Tyr Ser Thr Glu Gly Pro Ala 
15 1 5 10 15 

Ala Leu Leu Asp Gin Ala Lys Leu Ser Leu Leu Val Trp Val Leu Cys 
20 25 30 

20 Phe Val Leu Leu Phe Val Cys Phe Cys Gly Leu Ser Tyr Val Val lie 
35 40 45 

Ala Gin Val Pro Val Gly Leu Leu Cys lie Thr Glu Xaa 
50 55 60 

25 



(2) INFORMATION FOR SEQ ID NO: 392: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 

35 

Me^t_Leu_Trp_.Ehe_Ala_Asn JPiie_i>he.Jrhr_Tyr_Leu Ser_Gln_Ser . , 

15 10 15 

Val Ala Phe Val His lie Ser His lie Gly Val Arg Gin Val Asn Thr 
40 20 25 30 

Asn Cys Tyr Phe Ser Arg Lys Ser Tyr Cys Tyr Gly lie Leu Asn Pro 
35 40 45 

45 He Asn Cys He Lys Gly Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys 
50 55 60 

Lys Lys Lys Lys He Pro Ala Gly Arg Xaa Leu Phe Pro Phe Gly 
65 70 75 

50 



(2) INFORMATION FOR SEQ ID NO: 393: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 

60 
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Met Pro Gly Ala Phe Ser Glu Thr Val lie Asn Asp Leu Leu Ser Leu 
15 10 15 

Phe Leu Val Leu Pro Ala Glu Leu Ser Tyr Ser Thr Leu Ser Gly Val 
20 25 30 

Tyr Arg Asn Ala 
35 



(2) INFORMATICS FOR SEQ ID NO: 394: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 

20 Met Ala Gin Ser Arg Asp Gly Gly Asn Pro Phe Ala Glu Pro Ser Glu 
15 10 15 

Leu Asp Asn Pro Phe Gin Asp Pro Ala Val lie Gin His Arg Pro Ser 
20 25 30 



25 



Arg Gin Tyr Ala Thr Leu Asp Val Tyr Asn Pro Phe Glu Thr Arg Glu 
35 40 45 



Pro Pro Pro Ala Tyr Glu Pro Pro Ala Pro Ala Pro Leu Pro Pro Pro 
30 50 55 60 

Ser Ala Pro Ser Leu Gin Pro Ser Arg Lys Leu Ser Pro Thr Glu Pro 
65 70 75 80 

35 Lys Asn Tyr Gly Ser Tyr Ser Thr Gin Ala Ser Ala Ala Ala Ala Thr 

85 90 95 



40 



Ala Glu Leu Leu Lys Lys Gin Glu Glu Leu Asn Arg Lys Ala Glu Glu 
100 105 110 

Leu Asp Arg Arg Ser Glu Ser Cys Ser Met Leu Pro Trp Xaa Ala Gin 
115 120 125 



Leu Leu Asp Arg Thr lie Gly Pro Leu Tyr Leu Leu Phe Val Gin Phe 
45 130 135 140 

Ser Pro Ala Phe Ser Arg Thr Ser Pro Trp Arg Ser Pro Lys Asn Phe 
145 150 155 160 

50 Arg Arg Leu Tyr Pro Pro Cys Thr Thr Ser Gly Cys Ala Ala Arg Trp 

165 170 175 



55 



Xaa Phe Ser Xaa 
180 



(2) INFORMATION FOR SEQ ID NO: 395: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 

Met Pro Thr Pro Cys Thr Ser Leu Pro Ser Cys Cys Gin His Arg Ser 
15 10 15 



lie Thr Met Thr Leu 
10 20 



15 



(2) INFORMATION FOR SEQ ID NO: 396: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 

Met Pro Leu Phe lie Pro Leu He Phe Phe Leu Ser Leu Leu His Cys 
1.5 10 15 

25 Gin Ser Lys His Pro He Gin Met Ser Leu Cys Met Cys Val Asn He 
20 25 30 

Ser Leu Val Trp Ser Pro Val Arg Trp He Phe Gly Ser Lys Gly Leu 
35 40 45 



30 



Phe Ser Val His Leu Gin Ser Ser Gin Arg Pro Ser 
50 55 60 



35 

- ^(2->™ INFORMATION- FOR_SEQ^ID_NO:_ 39JL: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 
40 <B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 

Met Ala Gly Pro Arg Pro Xaa Trp Arg Asp Gin Leu I*eu Phe Met Ser 
45 1 5 10 15 

He He Val Leu Val He Val Val He Cys Leu Met Leu Tyr Ala Leu 
20 25 30 

50 Leu Trp Glu Ala Gly Asn Leu Thr Asp Leu Pro Asn Leu Arg He Gly 
35 40 45 

Phe Tyr Asn Phe Cys Leu Trp Asn Glu Asp Thr Ser Thr Leu Gin Cys 
50 55 60 

55 

His Gin Phe Pro Glu Leu Glu Ala Leu Gly Val Pro Arg Val Gly Leu 
65 70 75 80 

Gly Leu Ala Arg Leu Gly Val Tyr Gly Ser Leu Val Leu Thr Leu Phe 
60 85 90 95 
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Ala Pro Gin Pro Leu Leu Leu Ala Gin Cys Asn Xaa Asp Glu Arg Ala 
100 105 110 . 

" 5 Trp Arg Leu Ala Val Gly Phe Leu Ala Val Ser Ser Val Leu Leu Ala 
115 120 125 

Gly Gly Leu Gly Leu Phe Leu Ser Tyr Val Trp Asn Gly Ser Xaa Ser 
130 135 140 

10 

Pro Ser Arg Gly Leu Gly Phe Xaa 
145 150 



15 



(2) INFORMATION FOR SBQ ID NO: 398: 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 480 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 

Met Ser Tvsp Gly Phe Asp Arg Ala Pro Gly Ala Gly Arg Gly Arg Xaa 
25 1 5 10 15 

Arg Gly Leu Gly Arg Gly Gly Gly Gly Pro Xaa Gly Gly Gly Phe Pro 
20 25 30 

30 Xaa Gly Xaa Xaa Pro Ala Glu Arg Xaa Arg His Gin Pro Pro Gin Pro 
35 40 45 



35 



Lys Ala Pro Gly Phe Leu Gin Pro Xaa Pro Leu Arg Gin Pro Arg Thr 
50 55 60 

Thr Pro Pro Pro Gly Ala Gin Cys Glu Val Pro Ala Ser Pro Gin Arg 



65 70 75 80 

Pro Ser Arg Pro Gly Ala Leu Pro Glu Gin Thr Arg Pro Leu Arg Ala 
40 85 90 95 

Pro Pro Ser Ser Gin Asp Lys lie Pro Gin Gin Asn Ser Glu Ser Ala 
100 105 110 

45 Met Ala Lys Pro Gin Val Val Val Ala Pro Val Leu Met Ser Lys Leu 
115 120 125 



50 



Ser Val Asn Ala Pro Glu Phe Tyr Pro Ser Gly Tyr Ser Ser Ser Tyr 
130 135 140 

Thr Glu Ser Tyr Glu Asp Gly Cys Glu Asp Tyr Pro Thr Leu Ser Glu 
145 150 155 160 



Tyr Val Gin Asp Phe Leu Asn His Leu Thr Glu Gin Pro Gly Ser Phe 
55 165 170 175 

Glu Thr Glu lie Glu Gin Phe Ala Glu Thr Leu Asn Gly Cys Val Thr 
180 185 190 

60 Thr Asp A^ Ala Leu Gin Glu Leu Val Glu Leu lie Tyr Gin Gin Ala 
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195 200 205 

Thr Ser lie Pro Asn Phe Ser Tyx Met Gly Ala Arg Leu Cys Asn Tyr 
210 215 220 

- 5 

Leu Ser His His Leu Thr lie Ser Pro Gin Ser Gly Asn Phe Arg Gin 
225 230 235 240 

Leu Leu Leu Gin Arg Cys Arg Thr Glu Tyr Glu Val Lys Asp Gin Ala 
10 245 250 255 

Ala Lys Gly Asp Glu Val Thr Arg Lys Arg Phe His Ala Phe Val Leu 
260 265 270 

15 Phe Leu Gly Glu Leu Tyr Leu Asn Leu Glu lie Lys Gly Thr Asn Gly 
275 280 285 



20 



Gin Val Thr Arg Ala Asp lie Leu Gin Val Gly Leu Arg Glu Leu Leu 
290 295 300 

Asn Ala Leu Phe Ser Asn Pro Met Asp Asp Asn Leu lie Cys Ala Val 
305 310 315 320 



Lys Leu Leu Lys Leu Thr Gly Ser Val Leu Glu Asp Ala Txp Lys Glu 
25 325 330 335 

Lys Gly Lys Met Asp Met Glu Glu lie He Gin Arg He Glu Asn Val 
340 345 350 

30 Val Leu Asp Ala Asn Cys Ser Arg Asp Val Lys Gin Met Leu Leu Lys 
355 360 365 



35 



Leu Val Glu Leu Arg Ser Ser Asn Trp Gly Arg Val His Ala Thr Ser 
370 375 380 

'Ihr- Tyr -Arg- Glu^a-Thr- Pro -Glu-Asn^p- Pro -Asn -Tyr-E^ie^-Mefe-Asn-- 
385 390 395 400 

Glu Pro Thr Phe Tyr Thr Ser Asp Gly Val Pro Phe Thr Ala Ala Asp 
40 405 410 415 

Pro Asp Tyr Gin Glu Lys Tyr Gin Glu Leu Leu Glu Arg Glu Asp Phe 
420 425 430 

45 Phe Pro Asp Tyr Glu Glu Asn Gly Thr Asp Leu Ser Gly Ala Gly Asp 
435 440 445 

Pro Tyr Leu Asp Asp He Asp Asp Glu Met Asp Pro Glu He Glu Glu 
450 455 460 

50 

Ala Tyr Glu Lys Phe Cys Leu Glu Ser Glu Arg Lys Arg Lys Gin Xaa 
465 470 475 480 



55 



(2) INFORMATION FOR SEQ ID NO: 399: 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 amino acids 

(B) TYPE: amino acid 
(D) TOPOIiOGY: lineeir 

5 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 399: 

Met Glu Pro Lys Thr lie Thr Asp Ala Leu Ala Ser Ser lie lie Lys 
15 10 15 

10 Ser Val Leu Pro Asn Phe Leu Pro Tyr Asn Val Met Leu Tyr Ser Asp 
20 25 30 



15 



Ala Pro Val Ser Glu Leu Ser Leu Glu Leu Leu Leu Leu Gin Val Val 
35 40 45 

Leu Pro Ala Leu Leu Glu Gin Gly His Thr Arg Gin Trp Leu Lys Gly 
50 55 60 



Leu Val Arg Ala Trp Thr Val Thr Ala Gly Tyr Leu Leu Asp Leu His 
20 65 70 75 80 

Ser Tyr Leu Leu Gly Asp Gin Glu Glu Asn Glu Asn Ser Ala Asn Gin 
85 90 95 

25 Gin Val Asn Asn Asn Gin His Ala Arg Asn Asn Asn Ala lie Pro Val 
100 105 110 



30 



Val Gly Glu Gly Leu His Ala Ala His Gin Ala lie Leu Gin Gin Gly 
115 120 125 

Gly Pro Val Gly Phe Gin Xaa Tyr Arg Arg Pro Leu Asn Phe Pro Leu 
130 135 140 



Arg lie Phe Leu Leu lie Val Phe Met Cys lie Thr Leu Leu lie Ala 
35 145 150 155 160 



Ser Leu lie Cys Leu Thr Leu Pro Val Phe Ala Gly Arg Trp Leu Met 
165 170 175 

40 Ser Phe Trp Thr Gly Thr Ala Lys He His Glu Leu Tyr Thr Ala Ala 
180 185 190 



45 



Cys Gly Leu Tyr Val Cys Trp Leu Thr He Arg Ala Val Thr Val Met 
195 200 205 

Val Ala Trp Met Pro Gin Gly Arg Arg Val He Phe Gin Lys Val Lys 
210 215 220 



Glu Trp Ser Leu Met He Met Lys Thr Leu He Val Ala Val Leu Leu 
50 225 230 235 240 

Ala Gly Val Val Pro Leu Leu Leu Gly L»eu Leu Phe Glu Leu Val He 
245 250 255 

55 Val Ala Pro Leu Arg Val Pro Leu Asp Gin Thr Pro Leu Phe Tyr Pro 
260 265 270 



60 



Trp Gin Asp Trp Ala Leu Gly Val Leu His Ala Lys He He Ala Ala 
275 280 285 
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lie Thr Leu Met Gly Pro Gin Trp Trp Leu Lys Thr Val lie Glu Gin 
290 295 300 

Val Tyr Ala Asn Gly lie Arg Asn lie Asp Leu His Tyr He Val Arg 
' 5 305 310 315 320 

Lys Leu Ala Ala Pro Val He Ser Val Leu Leu Leu Ser Leu Cys Val 
325 330 335 

10 Pro Tyr Val He Ala Ser Gly Val Val Pro Leu Leu Gly Val Thr Ala 
340 345 350 



15 



Glu Met Gin Asn Leu Val His Arg Arg He Tyr Pro Phe Leu Leu Met 
355 360 365 

Val Val Val Leu Met Ala He Leu Ser Phe Gin Val Arg Gin Phe Lys 
370 375 380 



Arg Leu Tyr Glu His He Lys Asn Asp Lys Tyr Leu Val Gly Gin Arg 
20 385 390 395 400 

Leu Val Asn Tyr Glu Arg Lys Ser Gly Lys Gin Gly Ser Ser Pro Pro 
405 410 415 

25 Pro Pro Gin Ser Ser Gin Glu 
420 



30 (2) INFORMATION FOR SEQ ID NO: 400: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 
35 <D) TOPOLOGY: linear 

( xi ) SEQUENC E DESCRIPTI ON: SEQ ^PJP^J^O i 



40 



Met Leu Arg Leu Asp He He Asn Ser Leu Val Thr Thr Val Phe Met 
15 10 15 

Leu He Val Ser Val Leu Ala Leu He Pro Glu Thr Thr Thr Leu Thr 
20 25 30 



Val Gly Gly Gly Val Phe Ala Leu Val Thr Ala Val Cys Cys Leu Ala 
45 35 40 45 

Asp Gly Ala Leu He Tyr Arg Lys Leu Leu Phe Asn Pro Ser Gly Pro 
50 55 60 

50 Tyr Gin Lys Lys Pro Val His Glu Lys Lys Glu Val Leu Xaa 
65 70 75 



55 (2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 
60 <D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 401: 

Met Leu Lys Gin Val Met Phe Val Phe Sex Gly Met Gly Pro Arg Ser 
15 10 15 

5 

His Cys Trp Gly Leu Pro Leu His Val Ala Pro Leu Cys Arg Gly His 
20 25 30 

Gin Ala Asp Ser Ser His Leu Leu Pro Leu Lys His Gin Gly Ala Trp 
10 35 40 45 

Asn Arg Asn Leu Ala Asn Gin Arg His Phe Phe Cys Pro Ser lie Phe 
50 55 60 

15 His Thr Cys Pro Thr Val Leu Phe Phe Xaa 
65 70 



20 (2) INFORMATION FOR SEQ ID NO; 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
(B> TYPE: cunino acid 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 402: 



30 



35 



Ala Arg Thr lie Leu Val Leu Tyr Leu Ser Leu Gin Arg Leu Glu Asn 
15 10 15 

Ijeu Ala Tyr His 
20 



(2) INFORMATION FOR SEQ ID NO: 403: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 

Met Pro Leu Pro Ser Val Pro lie Leu Gly lie Phe Ser Phe Leu lie 
45 1 5 10 15 

Pro Ser Ser Gin Gly Val Ser Tyr Thr Lys Leu Pro lie Ser Ser Pro 
20 25 30 

50 Gin Tyr Ser Pro Phe Val Asn Asp His Phe Ser Phe Leu Asn Pro Phe 
35 40 45 



55 



Pro Val Gin lie His Thr Gly Phe Ala Arg Val Gly Ser Tyr Met Gin 
50 55 60 

Met Pro Leu Val His Leu Cys Leu Leu Gin Thr Ser Leu Met Lys Asn 
65 70 75 80 



Ser Gly Val Gin Gin Gly Ser 
60 85 
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(2) INFORMATION FOR SEQ ID NO: 404: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 
(D) TOPODOGY: linear 

10 (xi) SEQUENCE DESCRIPTIC»I : SEQ ID NO: 404: 

Met Asn Ala Ala Met Val His He Asn Arg Ala Leu Lys Leu He He 
15 10 15 

15 Arg Leu Phe Leu Val Glu Asp Leu Val Asp Ser Leu Lys Leu Ala Val 
20 25 30 

Phe Met Trp Leu Met Thr Tyr Val Gly Ala Val Phe Asn Gly He Thr 
35 40 45 



20 



30 



Leu Leu He Leu Ala Glu Leu Leu He Phe Ser Val Pro He Val Tyr 
50 55 60 



Glu Lys Tyr Lys Thr Gin He Asp His Tyr Val Gly He Ala Arg Asp 
25 65 70 75 80 



Gin Thr Lys Ser He Val Glu Lys He Pro Ser Lys 
85 90 



(2) INFORMATION FOR SEQ ID NO: 405: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 21 amino acids 

tB-)-TYPBr- amino—acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

40 Met Ala Cys Ser Cys Leu Met He Gin Ser Phe Ser Thr Ser Ala Leu 
15 10 15 



Val Leu Phe Tyr Gly 
20 



45 



(2) INFORMATION FOR SEQ ID NO: 406: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

55 

Met Glu Glu Gly Gly Asn Leu Gly Gly Leu He Lys Met Val His Leu 
15 10 15 



Leu Val Leu Ser Gly Ala Trp Gly Met Gin Met Trp Val Thr Phe Val 
60 20 25 30 
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Ser Gly Phe Pro Ala Phe Pro Lys Pro Ser Pro Thr Tyr Leu Arg Thr 
35 40 45 

5 Ser Ala Glu Gin Thr Leu Pro Leu Leu Leu Pro His Leu His Gly Leu 
50 55 60 

Cys Leu His Gin Pro Leu His Leu Gly Phe Thr Ala Cys Leu Gly Ser 
65 70 75 80 



10 



25 



Ala His He Leu Gly Gly Gin Pro Ala Leu Pro Ala Val Pro Glu Pro 
85 90 95 



Tyr Ala Gly His Cys Gin Arg Pro Leu Ala Gly Thr Pro His His Ser 
15 100 105 110 

Cys His Val Gly Pro Ala Asn Arg Gly Arg Arg Ser Glu Ala Trp Val 
115 120 125 

20 Gly Arg Tyr Gin Ala Ala Asn Arg Phe Pro He Leu Asn Ala Xaa Cys 
130 135 140 



Glu Arg Arg Thr Pro Ser Thr Val Leu Ser Ala Arg He Ser Ser Ala 
145 150 155 160 

Thr Met Gly Cys Pro Leu Phe Ala He Trp Ala Ala Ser Xaa 
165 170 



30 

(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 
35 (B) TYPE: amino acid 

( D) TO POLOGY: linea r 

DESCRIPTION: SEQ~ID NO: 407 : ~ 

Met Ala Phe He Leu Leu Phe Tyr Cys Leu Met Thr Phe Leu Ser Leu 
40 1 5 10 15 

Glu Gin Asn Ser Ala Thr Val Glu Pro Ser Ser His Glu He Leu His 
20 25 30 

45 Leu Leu Gin Asn Cys Phe Glu Leu Leu Arg Thr Ser Thr Ser Gin Cys 
35 40 45 

Thr Glu Gly He Pro Cys Gin Arg Tyr Gin Asn Gly Leu His He Xaa 
50 55 60 

50 



55 

(2) INFORMATION FOR SEQ ID NO: 408: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 amino acids 
60 (B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 408: 

Met Glu Ala Val Val Asn Leu Tyr Gin Glu Val Met Lys His Ala Asp 
5 1 5 10 15 

Pro Arg lie Gin Gly Tyr Pro Leu Met Gly Ser Pro Leu Leu Met Thr 
20 25 30 

10 Ser lie Leu Leu Thr Tyr Val Tyr Phe Val Leu Ser Leu Gly Pro Arg 
35 40 45 



15 



30 



45 



He Met Ala Asn Arg Lys Pro Phe Gin Leu Arg Gly Phe Met He Val 

50 55 ' 60 

Tyr Asn Phe Ser Leu Val Ala Leu Ser Leu Tyr He Val Tyr Glu Phe 

65 70 75 80 



Leu Met Ser Gly Trp Leu Ser Thr Tyr Thr Trp Arg Cys Asp Pro Val 
20 85 90 95 

Asp Tyr Ser Asn Ser Pro Glu Ala Leu Arg Met Val Arg Val Ala Trp 
100 105 110 

25 Leu Phe Leu Phe Ser Lys Phe He Glu Leu Met Asp Thr Val He Phe 
115 120 125 



He Leu Arg Lys Lys Asp Gly Gin Val Thr Phe Leu His Val Phe His 
130 135 140 

His Ser Val Leu Pro Trp Ser Trp Trp Trp Gly Val Lys He Ala Pro 
145 150 155 160 



Gly Gly Met Gly Ser Phe His Ala Met He Asn Ser Ser Val His Val 
35 165 170 175 



He Met Tyr Leu Tyr Tyr Gly Leu Ser Ala Phe Gly Pro Val Ala Gin 
180 185 190 

40 Pro Tyr Leu Trp Trp Lys Lys His Met Thr Ala He Gin Leu He Gin 
195 200 205 

Phe Val Leu Val Ser Leu His He Ser Gin Tyr Tyr Phe Met Ser Ser 
210 215 220 



Cys Asn Tyr Gin Tyr Pro Val He He His Leu He Trp Met Tyr Gly 
225 230 235 240 



Thr He Phe Phe Met Leu Phe Ser Asn Phe Trp Tyr His Ser Tyr Thr 
50 245 250 255 

Lys Gly Lys Arg Leu Pro Arg Ala Leu Gin Gin Asn Gly Ala Pro Gly 
260 265 270 

55 He Ala Lys Val Lys Ala Asn Xaa 
275 280 



60 (2) INFORMATION FOR SEQ ID NO: 409: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 
"5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 



10 



Met Xaa Leu Trp Pro Gin Thr Cys Ser Gly Lys Phe Asp Gly Thr Leu 

15 10 15 

Ala Phe Ser lie His Xaa Leu Ala Val lie Leu Gly Asp Gin Leu Thr 

20 25 30 



Ala Ala Asp Leu Val Pro lie Phe Asn Gly Phe Leu Lys Asp Leu Asp 
15 35 40 45 

Glu Val Arg lie Gly Val Leu Lys His Leu His Asp Phe Leu Lys Leu 
50 55 60 

20 Leu His lie Asp Lys Arg Arg Glu Tyr Leu Tyr Gin Leu Gin Glu Phe 
65 70 75 80 



25 



Leu Val Thr Asp Asn Ser Arg Asn Trp Arg Phe Arg Ala Glu Leu Ala 
85 90 95 

Glu Gin Leu lie Leu Leu Leu Glu Leu Tyr Ser Pro Arg Asp Val Tyr 
100 105 110 



Asp Tyr Leu Arg Pro lie Ala Leu Asn Leu Cys Ala Asp Lys Val Ser 
30 115 120 125 

Ser Val Arg Trp lie Ser Tyr Lys Leu Val Ser Glu Met Val Lys Lys 
130 135 140 

35 Leu His Ala Ala Thr Pro Pro Thr Phe Gly Val Asp Leu lie Asn Glu 
145 ^ 150 155 160 



40 



Leu Val Glu Asn Phe Gly Arg Cys Pro Lys Trp Ser Gly Arg Gin Ala 
165 170 175 

Phe Val Phe Val Cys Gin Thr Val lie Glu Asp Asp Cys Leu Pro Met 
180 185 190 



Asp Gin Phe Ala Val His Leu Met Pro His L.eu Leu Thr Leu Ala Asn 
45 195 200 205 

Asp Arg Val Pro Asn Val Arg Val Leu Leu Ala Lys Thr Leu Arg Gin 
210 215 220 

50 Thr Leu Leu Glu Lys Asp Tyr Phe Leu Ala Ser Ala Ser Cys His Gin 
225 230 235 240 



55 



Glu Ala Val Glu Gin Thr lie Met Ala Leu Gin Met Asp Arg Asp Ser 
245 250 255 

Asp Val Lys Tyr Phe Ala Ser lie His Pro Ala Ser Thr Lys He Ser 
260 265 270 



Glu Asp Ala Met Ser Thr Ala Ser Ser Thr Tyr Xaa 
60 275 280 
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(2) INFORMATION FOR SEQ ID NO: 410: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: lineeoT 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

Met Leu Phe Leu Phe Phe Val lie lie Phe Leu Phe Val Phe Leu lie 
15 10 15 

15 Leu lie lie Gin Phe Ser Lys Pro Leu Thr Asn Pro His Pro Pro Ala 
20 25 30 

Gly Xaa Ser Asp Arg Arg Arg Arg Tyr Ser Ser Tyr Arg Ser His Asp 
35 40 45 



20 



35 



His Tyr Gin Arg Gin Arg Val Leu Gin Lys Glu Arg Ala lie Glu Glu 
50 55 60 



Arg Arg Val Val Phe lie Gly Lys He Pro Gly Arg Met Thr Arg Ser 
25 65 70 75 80 

Glu Leu Lys Gin Arg Phe Ser Val Phe Gly Glu He Glu Glu Cys Thr 
85 90 95 

30 He His Phe Arg Val Gin Gly Asp Asn Tyr Gly Phe Val Thr Tyr Arg 
100 105 110 



Tyr Ala Glu Glu Ala Phe Ala Ala He Glu Ser Gly His Lys Leu Arg 
115 120 125 



-^Gln. Ala-J^p- Glu -Gin- Ero^ Phe-Asp Leu Cys-Phe-Gly-Gly-J^g- ArgOCaa. 
130 135 140 



Xaa Cys Lys Arg Ser Tyr Ser Asp Leu Asp Ser Asn Arg Glu Asp Phe 
40 145 150 155 160 

Asp Pro Ala Pro Val Lys Ser Lys Phe Asp Ser teu Asp Phe Asp Thr 
165 170 175 

45 Leu Leu Lys Gin Ala Gin Lys Asn Leu Arg Arg 
180 185 



50 (2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 237 amino acids 
(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 

Met Lys Leu Pro Gly Lys Phe Arg Arg Ala His Gin Gly Asn Leu Glu 
15 10 15 

60 
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Ser Gin Leu Thr Ser Glu Ser Tyr Tyr Lys Glu Thr Leu Ser Val Pro 
20 25 30 

Thr Val Glu His lie lie Gin Glu Leu Lys Asp lie Phe Ser Glu Gin 
5 35 40 45 

His Leu Lys Ala Leu Lys Cys Leu Ser Leu Val Pro Ser Val Met Gly 
50 55 60 

10 Gin Leu Lys Phe Asn Thr Ser Glu Glu His His Ala Asp Met Tyr Arg 
65 70 75 80 



15 



30 



Ser Asp Leu Pro Asn Pro Asp Thr Leu Ser Ala Glu Leu His Cys Trp 
85 90 95 

Arg lie Lys Trp Lys His Arg Gly Lys Asp lie Glu Leu Pro Ser Thr 
100 105 110 



lie Tyr Glu Ala Leu His Leu Pro Asp lie Lys Phe Phe Pro Asn Val 
20 115 120 125 

Tyr Ala Leu Leu Lys Val Leu Cys lie Leu Pro Val Met Lys Val Glu 
130 135 140 

25 Asn Glu Arg Tyr Glu Asn Gly Arg Lys Arg Leu Lys Ala Tyr Leu Arg 
145 150 155 160 



Asn Thr Leu Thr Asp Gin Arg Ser Ser Asn Leu Ala Leu Leu Asn lie 
165 170 175 

Asn Phe Asp lie Lys His Asp Leu Asp Leu Met Val Asp Thr Tyr lie 
180 185 190 



Lys Leu Tyr Thr Xaa Xaa Ser Xaa Leu Xaa Thr Xaa Xaa Ser Xaa Xaa 
35 195 200 205 

Val Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa 
210 215 220 

40 Asp Xaa Xaa Xaa Arg Glu Lys Ala Val Arg Cys Met Xaa 
225 230 235 



45 (2) INFORMATION FOR SEQ ID NO: 412: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 



55 



Met Lys Pro Met Ala Val Val Ala Ser Thr Val Leu Gly Leu Val Gin 

15 10 15 

Asn Met Arg Ala Phe Gly Gly lie Leu Val Val Val Tyr Tyr Val Phe 

20 25 30 



Ala lie lie Gly lie Asn Leu Phe Arg Gly Val lie Val Ala Leu Pro 
60 35 40 45 
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Gly Asn Ser Ser Leu Ala Pro Ala Asn Gly Ser Ala Pro Cys Gly Ser 
50 55 60 

5 Phe Glu Gin Leu Glu Tyr Trp Ala Asn Asn Phe Asp Asp Phe Ala Ala 
65 70 75 80 

Ala Leu Val Thr Leu Trp Asn Leu Met Val Val Asn Asn Trp Gin Val 
85 90 95 



10 



25 



Phe Leu Asp Ala Tyr Arg Arg Tyr Ser Gly Pro Trp Ser Lys lie Tyr 
100 105 110 



Phe Val Leu Trp Trp Leu Val Ser Ser Val He Trp Val Asn Leu Phe 
15 115 120 125 

Leu Ala Leu He Leu Glu Asn Phe Leu His Lys Trp Asp Pro Arg Ser 
130 135 140 

20 His Leu Gin Pro Leu Ala Gly Thr Pro Glu Ala Thr Tyr Gin Met Thr 
145 150 155 160 



Val Glu Leu Leu Phe Arg Asp He Leu Glu Glu Pro Gly Glu Asp Glu 
165 170 175 

Leu Thr Glu Arg Leu Ser Gin His Pro His Leu Trp Leu Cys Arg Xaa 
180 185 190 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 413: 



(i) SEQUENCE CHARACTERXSTICS : 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 

Asn Val Val Val Val Ala Phe Gly Leu He Leu He He Glu Ser Leu 
15 10 15 

45 Gly Glu Gin Cys Pro 
20 



50 (2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

Met Asn Trp Gly Leu Ser He Trp Leu His Tyr Tyr Glu Lys Lys Lys 
15 10 15 

60 
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Glu Gin Val Phe Leu Val lie Leu Ala His Val Val Arg Arg Cys Ala 
20 25 30 

Ser Asp Gly lie Leu Gin Phe Glu Ser Ser Leu Leu Lys Met Arg Arg 
5 35 40 45 

Ala Pro Xaa 
50 

10 

(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 

20 Met Leu lie lie Ser Leu Arg Pro Gin Phe Pro Ser Leu He Val Gin 
15 10 15 

Leu Glu Cys Ser Val Leu Phe Leu Pro He Ser Leu Asn Leu Leu Leu 
20 25 30 

25 



30 

(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416:* 

Met Val Lys Val Cys Asn Asp Ser Asp Arg Trp Ser Leu He Ser Leu 
40 1 5 10 15 

Ser Asn Asn Ser Gly Lys Asn Val Glu Leu Lys Phe Val Asp Ser Leu 
20 25 30 

45 Arg Arg Gin Phe Glu Phe Ser Val Asp Ser Phe Gin He Lys Leu Asp 
35 40 45 

Ser Leu Leu Leu Phe Tyr Glu Cys Ser Glu Asn Pro Met Thr Glu Thr 
50 55 60 

50 

Phe His Pro Thr He He Gly Glu Ser Val Tyr Gly Asp Phe Gin Glu 
65 70 75 80 

Ala Phe Asp His Leu Cys Asn Lys He He Ala Thr Arg Asn Pro Glu 
55 85 90 95 

Glu He Arg Gly Gly Gly Leu Leu Lys Tyr Cys Asn Leu Leu Val Arg 
100 105 110 

60 Gly Phe Arg Pro Ala Ser Asp Glu He Lys Thr Leu Gin Arg Tyr Met 



SOOCID: <WO__9839448A2^IA> 



wo 98/39448 



PCT/US98/04493 



604 



115 120 125 

Cys Ser Arg Phe Phe He Asp Phe Ser Asp He Gly Glu Gin Gin Arg 
130 135 140 

5 

Lys L*eu Glu Ser Tyr Leu Gin Asn His Phe VclI Gly He Gly Arg Pro 
145 150 155 160 

Gin Val Xaa 

10 



15 



(2) INFORMATION FOR SEQ ID NO: 417: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

Met Ala Pro Lys Gly Lys Val Gly Thr Arg Gly Lys Lys Gin He Phe 
15 10 15 

25 Glu Glu Asn Arg Glu Thr Leu Lys Phe Tyr Leu Arg He He Leu Gly 
20 25 30 

Ala Asn Ala He Tyr Cys Leu Val Thr Leu Val Phe Phe Tyr Ser Ser 
35 40 45 



30 



45 



55 



Ala Ser Phe Trp Ala Trp Leu Ala Leu Gly Phe Ser Leu Ala Val Tyr 
50 55 60 



Gly Ala Ser Tyr His Ser Met Ser Ser Met Ala Arg Ala Ala Phe Ser 
35 65 70 75 80 

Glu Asp Gly Ala Leu Met Asp Gly Gly Met Asp Leu Asn Met Glu Gin 
85 90 95 

40 Gly Met Ala Glu His Leu Lys Asp Val He Leu Leu Thr Ala He Val 
100 105 110 



Gin Val Leu Ser Cys Phe Ser Leu Tyr Val Trp Ser Phe Trp heu Leu 
115 120 125 

Ala Pro Gly Arg Ala Leu Tyr Leu Leu Trp Val Asn Val Leu Gly Pro 
130 135 140 



Trp Phe Thr Ala Asp Ser Gly Thr Pro Ala Pro Glu His Asn Glu Lys 
50 145 150 155 160 



Arg Gin Arg Arg Gin Glu Arg Arg Gin Met Lys Arg Leu Xaa 
165 170 



(2) INFORMATION FOR SEQ ID NO: 418: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 50 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

Met Glu Leu Pro Lys Gly Leu Gin Gly Val Gly Pro Val Ala Met Met 
15 10 15 

Arg Pro Phe Tyr Leu Leu Leu Pro Val Leu Cys Thr Gin Ala Leu Arg 
20 25 30 

Gin Ser Gin Gly Lys Ser Pro Leu Leu Trp Lys Arg Thr Cys Cys Leu 
35 40 45 



Ala Xaa 
15 50 



20 



(2) INFORMATION FOR SEQ ID NO: 419: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 

Met Leu Gly Lys Gly Gly Gly Arg Ala Gly Leu Leu Arg Tyr Arg Leu 
15 10 15 

30 Leu Tyr Phe Thr Leu Val Val Gly Glu Gly Glu Pro Gly Glu Asn Lys 
20 25 30 

Val Thr lie Pro Phe Phe Glu Thr Gly Lys Lys lie lie Phe Cys Ser 
35 40 45 



35 



Val Lys Met Val Glu Asn Ser Asn Val Pro Ser His Lys Gly Pro Val 

50 55 - 60 - ^ - 



Pro Leu Arg Ser Glu Gin Trp Glu Leu Lys He Ser Glu Thr Leu Gly 
40 65 70 75 80 

Glu Gly Lys He Gly Phe Leu Leu He Gly Arg Cys Ser Ser Gly Xaa 
85 90 95 

45 Gly Gly Leu Cys Phe Cys Trp Asp Val Leu Cys Cys Met Tyr Ala Tyr 
100 105 110 



50 



Met Asp Arg Ser Leu Leu Ser Leu 
115 120 



(2) INFORMATION FOR SEQ ID NO: 420: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 

60 



SDOCID: <WO_9839448A2JA> 



wo 98/39448 



PCT/US98/04493 



606 



Met Thr His Leu Leu Leu Thr Ala Thr Val Thr Pro Ser Glu Gin Asn 
1 5 .10 15 

Ser Ser Arg Glu Pro Gly Trp Glu Thr Ala Met Ala Lys Asp lie Leu 
5 20 ,25 30 

Gly Glu Ala Gly Leu His Phe Asp Glu Leu Asjn Lys Leu Arg Val Leu 
35 40 45 

10 Asp Pro Glu Val Thr Gin Gin Thr lie Glu Leu Lys Glu Glu Cys Lys 
50 55 60 

Asp Phe Val Asp Lys He Gly Gin Phe Gin Lys He Val Gly Gly Leu 
65 70 75 80 

15 

He Glu Leu Val Asp Gin Leu Ala Lys Glu Ala Glu Asn Glu Lys Met 
85 90 95 



Lys Ala He Gly Ala Arg Asn Leu Leu Lys Ser He Ala Lys Gin Arg 
20 100 105 110 

Glu Ala Gin Gin Gin Gin Leu Gin Ala Leu He Ala Glu Lys Lys Met 
115 120 125 

25 Gin Leu Glu Arg Tyr Arg Val Glu Tyr Glu Ala Leu Cys Lys Val Glu 
130 135 140 



30 



Ala Glu Gin Asn Glu Phe He Asp Gin Phe He Phe Gin Lys Xaa 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 421: 



35 (i) SEQUENCE CHARACTERISTICS: 

(A) LEasTCTH: _154 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 421: 

40 

Met Asn Val Gly Val Ala His Ser Glu Val Asn Pro Asn Thr Arg Val 
15 10 15 



Met Asn Ser Arg Gly Met Trp Leu Thr Tyr Ala Leu Gly Val Gly Leu 
45 20 25 30 

Leu His He Val Leu Leu Ser He Pro Phe Phe Ser Val Pro Val Ala 
35 40 45 

50 Trp Thr Leu Thr Asn He He His Asn Leu Gly Met Tyr Val Phe Leu 
50 55 60 

His Ala Val Lys Gly Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ala 
65 70 75 80 

55 

Arg Leu Leu Thr His Trp Glu Gin Leu Asp Tyr Gly Val Gin Phe Thr 
85 90 95 



Ser Ser Arg Lys Phe Phe Thr He Ser Pro He He Leu Tyr Phe Leu 
60 100 105 110 
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Ala Ser Phe Tyr Thr Lys Tyr Asp Pro Thr His Phe lie Leu Asn Thr 
115 120 125 

5 Ala Ser Leu Leu Ser Val Leu lie Pro Lys Met Pro Gin Leu His Gly 
130 135 140 



10 



20 



35 



50 



60 



Val Arg lie Phe Gly He Asn Lys Tyr Xaa 
145 150 



(2) INFORMATIC^J FOR SEQ ID NO: 422: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 



Met Val Cys Gly Gly Phe Ala Cys Ser Lys Asn Cys Leu Cys Ala Leu 
15 10 15 



Asn Leu Leu Tyr Thr Leu Val Ser Leu Leu Leu He Gly He Ala Ala 
25 20 25 30 

Trp Gly He Gly Phe Gly Leu He Ser Ser Leu Arg Val Val Gly Val 
35 40 45 

30 Val He Ala Val Gly He Phe Leu Phe Leu He Ala Leu Val Gly Leu 
50 55 60 



He Gly Ala Val Lys His His Gin Val Leu Leu Phe Phe Tyr Met He 

65 70 75 80 

He Leu Leu Leu Val Phe He Val Gin Phe Ser Val Ser Cys Ala Cys 

85 90 95 



Leu Ala Leu Asn Gin Glu Gin Gin Gly Gin Leu Leu Glu Val Gly Trp 
40 100 105 110 

Asn Asn Thr Ala Ser Ala Arg Asn Asp He Gin Arg Asn Leu Asn Cys 
115 120 125 

45 Cys Gly Phe Arg Ser Val Asn Pro Asn Asp Thr Cys Leu Ala Ser Cys 
130 135 140 



Val Lys Ser Asp His Ser Cys Ser Pro Cys Ala Pro He He Gly Glu 
145 150 155 160 

Tyr Ala Gly Glu Val Leu Arg Phe Val Gly Gly He Gly Leu Phe Phe 
165 170 175 



Ser Phe Thr Glu He Leu Gly Val Trp Leu Thr Tyr Arg Tyr Arg Asn 
55 180 185 190 



Gin Lys Asp Pro Arg Ala Asn Pro Ser Ala Phe Leu 
195 200 
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(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LETKTTH: 67 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

10 Met Leu Gin Ser lie lie Lys Asn lie Trp lie Pro Met Lys Pro Tyr 
15 10 15 



Tyr Thr Lys Val Tyr Gin Glu 
20 

15 

Phe lie Val Tyr Lys lie Arg 
35 

Lys Ala Ser Ala Pro Ala Pro 
20 50 55 



He Trp He Gly Met Gly Leu Met Gly 
25 30 

Ala Ala Asp Lys Arg Ser Lys Ala Leu 
40 45 

Gly His His Asn Gin He Tyr Leu Glu 
60 



Tyr Met Xaa 
65 

25 

(2) INFORMATION FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 25 cunino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

35 Met Leu Gly Val Ser Leu Phe Leu Leu Val Val Leu Tyr His Tyr Val 

1 . 5 10 . _ . 15 

Ala Val Asn Asn Pro Lys Lys Gin Glu 
20 25 

40 



(2) INFORMATION FOR SEQ ID NO: 425: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 299 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»a: SEQ ID NO: 425: 

50 

Met Ala Ala Xaa Glu Pro Ala Val Leu Ala Leu Pro Asn Ser Gly Ala 
1 5 . 10 15 

Gly Gly Ala Gly Ala Pro Ser Gly Thr Val Pro Val Leu Phe Cys Phe 
55 20 25 30 

Ser Val Phe Ala Arg Pro Ser Ser Val Pro His Gly Ala Gly Tyr Glu 
35 40 45 

60 Leu Leu lie Gin Lys Phe Leu Ser Leu Tyr Gly Asp Gin He Asp Met 
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50 55 60 

His Arg Lys Phe Val Val Gin Leu Phe Ala Glu Glu Trp Gly Gin Tyr 
^ 65 70 75 80 

Val Asp Leu Pro Lys Gly Phe Ala Val Ser Glu Arg Cys Lys Val Arg 
85 90 95 

Leu Val Pro Leu Gin lie Gin Leu Thr Thr Leu Gly Asn Leu Thr Pro 
10 100 105 110 

Ser Ser Thr Val Phe Phe Cys Cys Asp Met Gin Glu Arg Phe Arg Pro 
115 120 125 

15 Ala lie Lys Tyr Phe Gly Asp lie lie Ser Val Gly Gin Arg Leu Leu 
130 135 140 



20 



35 



Gin Gly Ala Arg lie Leu Gly lie Pro Val lie Val Thr Glu /Gin Tyr 
145 150 155 160 

Pro Lys Gly Leu Gly Ser Thr Val Gin Glu lie Asp Leu Thr Gly Val 
165 170 175 



Lys Leu Val Leu Pro Lys Thr Lys Phe Ser Met Val Leu Pro Glu Val 
25 180 185 190 

Glu Ala Ala Leu Ala Glu He Pro Gly Val Arg Ser Val Val Leu Phe 
195 200 205 

30 Gly Val Glu Thr His Val Cys He Gin Gin Thr Ala Leu Glu Leu Val 
210 215 220 



Gly Arg Gly Val Glu Val His He Val Ala Asp Ala Thr Ser Ser Arg 

225 230 235 240 

Ser Met Met Asp Arg Met Phe Ala Leu Glu Arg Leu Ala Xaa Xaa Gly 

245 250 255 



He He Val Thr Thr Ser Glu Ala Val Leu Leu Gin Leu Val Ala Asp 
40 260 265 270 

Lys Asp His Pro Lys Phe Lys Glu He Gin Asn Leu He Lys Ala Ser 
275 280 285 

45 Ala Pro Glu Ser Gly Leu Leu Ser Lys Val Xaa 
290 295 



50 (2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 

Met Arg Asp Leu Gly Thr Leu Leu Ser Pro Val Cys Ser 
15 10 

60 
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10 



25 



40 



(2) INFORMATION FOR SEQ ID NO: 427: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I£NGTH: 198 amino acids 

(B) TYPE: amino acid 
(D) TOPOLXDCT: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 

Met Phe Gly Cys Leu Val Ala Gly Arg Leu Val Gin Thr Ala Ala Gin 
15 10 15 



Gin Val Ala Glu Asp Lys Phe Val Phe Asp Leu Pro Asp Tyr Glu Ser 
15 20 25 30 

lie Asn His Val Val Val Phe Met Leu Gly Thr lie Pro Phe Pro Glu 
35 40 45 

20 Gly Met Gly Gly Ser Val Tyr Phe Ser Tyr Pro Asp Ser Asn Gly Met 
50 55 '60 



Pro Val Trp Gin Leu Leu Gly Phe Val Thr Asn Gly Lys Pro Ser Ala 
65 70 75 80 

lie Phe Lys lie Ser Gly Leu Lys Ser Gly Glu Gly Ser Gin His Pro 
85 90 95 



Phe Gly Ala Met Asn lie Val Arg Thr Pro Ser Val Ala Gin lie Gly 
30 100 105 110 

lie Ser Val Glu Leu Leu Asp Ser Met Ala Gin Gin Thr Pro Val Gly 
115 120 12S 

35 Asn Ala Ala Val Ser Ser Val Asp Ser Phe Thr Gin Phe Thr Gin Lys 
130 135 140 



Met Leu Asp Asn Phe Tyr Asn Phe Ala Ser Ser Phe Ala Val Ser Gin 
145 150 155 160 

Ala Gin Met Thr Pro Ser Pro Ser Glu Met Phe lie Pro Ala Asn Val 
165 170 175 



Val Leu Lys Trp Tyr Glu Asn Phe Gin Arg Arg Leu Ala Gin Asn Pro 
45 180 185 190 



50 



Xaa Phe Trp Xaa Thr Xaa 
195 



(2) INFORMATION FOR SEQ ID NO: 428: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 

60 Met Gly Leu Pro Leu Met Ala Leu Met Trp Ser Thr Leu Pro Ala Ser 
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10 



30 



45 



611 



15 10 15 

Ala Gly Val Asn Phe He Leu Ala Leu Pro Leu Leu Leu Leu Trp Lys 
20 25 30 

Asn Arg Gly Gly Val Gly Arg Ser Val Met Ser Ala Val Glu Xaa 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 429: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 429: 

Met Lys Lys Val Glu Glu Lys Arg Val Asp Val Asn Ser Ala Val Ala 
20 1 5 10 15 

Met Gly Glu Val He Leu Ala Val Cys His Pro Asp Cys He Thr Thr 
20 25 30 

25 He Lys His Trp He Thr He He Arg Ala Arg Phe Glu Glu Val Leu 
35 40 45 



Thr Trp Ala Lys Gin His Gin Gin Arg Leu Glu Thr Ala Leu Ser Glu 
50 55 60 

Leu Val Ala Asn Ala Glu Leu Leu Glu Glu Leu Leu Ala Trp He Gin 
65 70 75 80 



Trp Ala Glu Thr Thr Leu He Gin Arg Asp Gin Glu Pro He Pro Gin 
35 85 90 95 

Asn He Asp Arg Val Lys Ala Leu He Ala- Glu His Gin Thr- Phe Met 
100 105 110 

40 Glu Glu Met Thr Arg Lys Gin Pro Asp Val Asp Arg Val Thr Lys Thr 
115 120 125 



Tyr Lys Arg Lys Asn He Glu Pro Thr His Ala Pro Phe He Glu Lys 
130 135 140 

Ser Arg Ser Gly Gly Arg Lys Ser Leu Ser Gin Pro Thr Pro Pro Pro 
145 150 155 160 



Met Pro He Leu Ser Gin Ser Glu Ala Lys Asn Pro Arg He Asn Gin 
50 165 170 175 

Leu Ser Ala Arg Trp Gin Gin Val Trp Leu Leu Ala Leu Glu Arg Gin 
180 185 190 

55 Arg Lys Leu Asn Asp Ala Leu Asp Arg Leu Glu Glu Leu Lys Glu Phe 
195 200 205 



60 



Ala Asn Phe Asp Phe Asp Val Trp Arg Lys Lys Tyr Met Arg Trp Met 
210 215 220 
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Asn His Lys Lys Ser Arg Val Met Asp Phe Phe Arg Arg lie Asp Lys 
225 230 235 240 

Asp Gin Asp Gly Lys lie Thr Arg Gin Glu Phe lie Asp Gly He Leu 
5 245 250 255 

Ala Ser Lys Phe Pro Thr Thr Lys Leu Glu Met Thr Ala Val Ala Asp 
260 265 270 

10 He Phe Asp Arg Asp Gly Asp Gly Tyr He Asp Tyr Tyr Glu Phe Val 
275 280 285 



15 



Ala Ala Leu His Pro Asn Lys Asp Ala Tyr Arg Pro Thr Thr Asp Ala 
290 295 300 

Asp Lys He Glu Asp Glu Val Thr Arg Gin Val Ala Gin Cys Lys Cys 
305 310 315 320 



Ala Lys Arg Phe Gin Val Glu Gin He Gly Glu Asn Lys Tyr Arg Phe 
20 325 330 335 

Phe Leu Gly Asn Gin Phe Gly Asp Ser Gin Gin Leu Arg Leu Val Arg 
340 345 350 

25 He Leu Arg Asn Arg Asp Gly Ser Arg Trp Trp Arg Met Asp Gly Leu 
355 360 365 



Gly Xaa 
370 

30 



(2) INFORMATION FOR SEQ ID NO: 430: 

35 <i) SEQUENCE CHARACTERISTICS: 

(A) LENOTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 

40 

Met Asn Val Lys Thr Phe Ser Xaa Asp His Met His Phe Leu Cys Cys 
15 10 15 

Leu Tyr Leu Arg Tyr Val Thr Phe Val Tyr Leu Asn Leu Phe 
45 20 25 30 



50 



<2) INFORMATION FOR SEQ ID NO: 431: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431: 

Met Glu Pro His Leu Arg Cys Arg Val Thr Arg Val Arg Gly Ser Leu 
15 10 15 

60 Gly Asn Thr Gly Arg Trp Leu Leu 
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20 



5 (2) INFORMATION FOR SEQ ID NO: 432: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 



15 



25 



40 



55 



Met His Tyr Leu Val Leu Gly Gly Leu Gly Val Phe Leu Phe Phe Ser 
15 10 15 

Cys Phe Val Phe Leu Phe Phe Xaa Phe Ser Phe Ala Phe Phe Pro Phe 
20 25 30 



Tyr Leu Glu Gly Met Gly Gly Ser Gly Asn Arg Glu Val Gly Gly Gly 
20 35 40 45 



Phe Cys Leu Phe Phe 
50 



(2) INFORMATION FOR SEQ ID NO: 433: 



(i) SEQUENCE CHARACTERISTICS: 
30 <A) LENGTH r 176 amino acids 

(B> TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 

35 Met Val Ser Lys Ala Leu Leu Arg Leu Val Ser Ala Val Asn Arg Arg 
15 10 15 



Arg Met Lys Leu Leu Leu Gly lie Ala Leu Leu Ala Tyr Val Ala Ser 
20 25 30 

Val Trp Gly Asn Phe Val Asn Met Arg Ser lie Gin Glu Asn Gly Glu 
35 40 45 



Leu Lys lie Glu Ser Lys lie Glu Glu Met Val Glu Pro rLeu Arg Glu 
45 50 55 60 

Lys lie Arg Asp Leu Glu Lys Ser Phe Thr Gin Lys Tyr Pro Pro Val 
65 70 75 80 

50 Lys Phe Leu Ser Glu Lys Asp Arg Lys Arg lie Leu lie Thr Gly Gly 

85 90 95 



Ala Gly Phe Val Gly Ser His Leu Thr Asp Lys Leu Met Met Asp Gly 
100 105 110 

His Glu Val Thr Val Val Asp Asn Phe Phe Thr Gly Arg Lys Arg Asn 
115 120 125 



Val Glu His Trp lie Gly His Glu Asn Phe Glu Leu lie Asn His Asp 
60 130 135 140 
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Val Trp Ser Pro Ser Thr Ser Arg Leu Thr Arg Tyr Thr He Trp His 
145 150 155 160 

5 Leu Gin Pro Pro Leu Gin Thr Thr Cys He He Leu Ser Arg His Xaa* 

165 170 175 



10 



(2) INFORMATION FOR SEQ ID NO: 434: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 



20 



35 



45 



Met Leu Arg Cys Trp Pro Leu Phe Trp Leu Pro Leu Val Ser Pro Phe 
15 10 15 



Cys Ser Leu Phe Trp I^eu Leu Val Glu Trp Phe Gly Thr Asn He Asp 
25 20 25 30 

Arg Glu Ser Tyr Asp Ala He Gly Gly Pro Ser Trp Met Thr Ala Ser 
35 40 45 

30 Ser Phe Cys Leu Ser Asn Ser Asn He Trp Ser Leu Glu He Ser Ser 
50 55 60 



Gly Ser Thr Ser Val Val His Ser Gin Gin Ala Met Asp 
65 70 75 



(2) INTORMATION FOR SEQ ID NO: 435: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 



Met Arg Ser Cys Glu He Gin Leu Cys Val Trp Leu Leu Val Ser Ser 
15 10 15 



His Val Asp Met Val Leu Gly Gly Ser Pro Ser Thr Leu Tyr Met Met 
50 20 25 30 



55 

(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: 
60 <A) LENGTH: 30 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 

5 Met Val Val Asn Ser Leu Cys Phe Leu Ser l^eu Leu Leu Val He Leu 
15 10 15 



10 



Glu Leu Ser Thr Asp Ser Ser Ala Arg Leu Leu Tyr His Glu 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 437: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 



20 



35 



45 



50 



Met Asp Lys Gin Lys His Leu Glu Val Arg Arg Ser Val Phe Lys He 
15 10 15 



Gin Gly Lys He Ala Phe Ser Leu Met Phe Val Leu Lys Asp Leu Ser 
25 20 25 30 

Pro Thr He Phe Ser His Ser He Leu Leu Leu Leu Pro His His Val 
35 40 45 

30 Leu Pro Cys Thr Pro Gin Met Val Arg Gly Val Thr Gin Val Leu Arg 
50 55 60 



Glu Phe Gly Asp Gin 
65 



(2) INFORMATION FOR SEQ ID NO: 438: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 



Met Pro Leu Cys Phe Phe Ser Phe Leu Cys Cys Trp Val Leu Val Phe 
15 10 15 

Lys Leu He 



55 



(2) INFORMATION FOR SEQ ID NO: 439: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: .439: 
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Met Lys Phe Ser Leu Val Leu Leu lie Lys lie lie Ser Phe Glu Arg 
15 10 15 

5 Leu Leu lie Phe Leu Phe Pro I^u Ser Phe Leu Pro Asn lie Trp Arg 
20 25 30 

Arg Val Met Val Asn Leu Asn He Leu Phe Xaa 
35 40 

10 



(2) INFORMATION FOR SEQ ID NO: 440: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 

20 

Met Leu Leu Phe Pro Ser Leu Leu Phe Ala Ala Thr Tyr Asn Val Ala 
15 10 15 

Asn Pro Ser Arg Leu He Leu Tyr Met He Ser Ala Gly Ala Asp Ser 
25 20 25 30 

Gin 



30 



45 



(2) INFORMATION FOR* SEQ ID NO: 441: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

40 Met Trp Gin Val Arg Gly Leu Pro Pro Val Pro Leu Leu Leu Thr Met 
15 10 15 

Ser Pro Pro Pro Cys Leu Ser Ser Pro Phe Pro Phe He Ser Val Pro 
20 25 30 



Leu Phe Glu Ala Val Pro He Ser Val Ser Asp Gin Pro Ser Pro Xaa 
35 40 45 



Leu Thr Thr Leu Leu 

50 50 



(2) INFORMATION FOR SEQ ID NO: 442: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442 ^ 
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Met lie Thr Ser Val Leu Val Phe 
1 5 

5 Ser Leu Val Thr Leu Leu Gin Ala 
20 

Ala Ala Leu Cys Glu Ser Gly Leu 
35 40 

10 

Asn Gin Leu Glu Pro Met Tyr Phe 
50 55 



Leu He Phe Phe Phe Pro Tyr Leu 
10 15 

Arg Asn Leu Trp Val He His Arg 
25 30 

Phe His Trp Arg Lys Gly He Glu 
45 

Leu Pro His Gly Thr Leu Phe Leu 
60 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 443: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 



30 



35 



Met Leu Tyr Ser Cys Glu Pro Tyr Leu He He Leu Asn He Tyr Ser 
15 10 15 

Gin Lys Ala Phe Tyr Phe Tyr Phe Phe Glu Gly Ser Phe Ser Val Cys 
20 25 30 

Thr Leu 



(2) INFORMATION FOR SEQ ID NO: 444: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 



45 



60 



Met Arg Gin Arg Gin Ala Ala Cys Gin Pro Pro Pro Ser Arg Asn Gly 
3- 5 10 15 



Leu Ala Gin Glu Cys Pro Pro His He Pro Ser Ser Phe Phe Leu Val 

50 20 25 30 

Lys Leu Leu Phe He Pro Trp Leu Ala Ser Leu Leu Ser Ser Pro Leu 

35 40 45 

55 Asn Leu Leu Leu Leu Val Ser He Ser Trp Asp Leu Gly Leu Lys Leu 

50 55 60 



Asn Leu Gin Gin Cys Arg Gin His Gin Val Leu Gin Glu Lys Asn Thr 
65 70 75 80 
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Lys Lys Phe Asn Lys Lys Lys Lys Lys 
85 



-5 



25 



40 



(2) INFORMATION FOR SEQ ID NO: 445: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOIiOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 

Met Asp Phe lie Thr Ser Thr Ala lie Leu Pro Leu Leu Phe Gly Cys 
15 1 5 10 15 

Leu Gly Val Phe Gly Leu Phe Arg Leu Leu Gin Trp Val Arg Gly Lys 
20 25 30 

20 Ala Tyr Leu Arg Asn Ala Val Val Val lie Thr Gly Ala Thr Ser Gly 
35 40 45 



Leu Gly Lys Glu Cys Ala Lys Val Phe Tyr Ala Ala Gly Ala Lys Leu 
50 55 60 

Val Leu Cys Gly Arg Asn Gly Gly Ala Leu Glu Glu Leu lie Arg Glu 
65 70 75 80 



Leu Thr Ala Ser His Ala Thr Lys Val Gin Thr His Lys Pro Tyr Leu 
30 85 90 95 

Val Thr Phe Asp Leu Thr Asp Ser Gly Ala lie Val Ala Ala Ala Ala 
100 105 110 

35 Glu lie Leu Gin Cys Phe Gly Tyr Val Asp lie Leu Val Asn Asn Ala 
115 120 125 



Gly lie Ser Tyr Arg Gly Thr lie Met Asp Thr Thr Val Asp Val Asp 
130 135 140 

Lys Arg Val Met Glu Thr Asn Tyr Phe Gly Pro Val Ala Leu Thr Lys 
145 150 155 160 



Ala Leu Leu Pro Ser Met He Lys Arg Arg Gin Gly His He Val Ala 
45 165 170 175 

He Ser Ser He Gin Gly Lys Met Ser He Pro Phe Arg Ser Ala Tyr 
180 185 190 

50 Ala Ala Ser Lys His Ala Thr Gin Ala Phe Phe Asp Cys Leu Arg Ala 
195 200 205 



55 



Glu Met Glu Gin Tyr Glu He Glu Val Thr Val He Ser Pro Gly Tyr 
210 215 220 

He His Thr Asn Leu Ser Val Asn Ala He Thr Ala Asp Gly Ser Arg 
225 230 235 240 



Tyr Gly Val Met Asp Thr Thr Thr Ala Gin Gly Arg Ser Pro Val Glu 
60 245 250 255 
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10 



20 



35 



40 



Val Ala Gin Asp Val Leu Ala Ala Val Gly Lys Lys Lys Lys Asp Val 
260 265 270 

lie Leu Ala Asp Leu Leu Pro Ser Leu Ala Val Tyr Leu Arg Thr Leu 
275 280 285 

Ala Pro Gly Leu Phe Phe Ser Leu Met Pro Pro Gly Pro Glu Lys Ser 
290 295 300 

Gly Asn Pro Arg Thr Pro Ser Thr Leu Thr Ser Gin Gly Gin Gly Arg 
305 310 315 320 



Glu Ala Ala Leu Leu Gly Leu Leu Thr Leu Gin Gly Thr Val Ala Phe 
15 325 330 335 



Val Glu Thr Leu Met Glu lie Cys Leu Thr Ser Gly Lys Asp 
340 345 350 



(2) INFORMATION FOR SEQ ID NO: 446: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

30 Met Val Phe Leu Pro Arg Gly Val Val Val Ser Gly Gly Ala Ala Cys 
15 10 15 



Leu Trp Leu Thr Phe lie Leu Glu Thr Glu Val Tyr Leu Asp Leu Ala 
20 25 30 

Thr Glu Ala Arg Ala His Ser Arg Met Gly Leu Gly I^eu Trp Pro Pro 
35 40 45 

Asn 



45 



60 



(2) INFORMATION FOR SEQ ID NO: 447: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 

Met Ala Ser Ala Glu Leu Asp Tyr Thr lie Glu lie Pro Asp Gin Pro 
15 10 15 

55 Cys Trp Ser Gin Lys Asn Ser Pro Ser Pro Gly Gly Lys Glu Ala Glu 
20 25 30 



Thr Arg Gin Pro Val Val lie Leu Leu Gly Trp Gly Gly Cys Lys Asp 
35 40 45 
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Lys Asn Leu Ala Lys Tyr Ser Ala lie Tyr His Lys Arg Gly Cys lie 
50 55 60 

Val lie Arg Tyr Thr Ala Pro Trp His Met Val Phe Phe Ser Glu Ser 
- 5 65 70 75 80 

Leu Gly lie Pro Ser Leu Arg Val Leu Ala Gin Lys Leu Leu Glu Leu 
85 90 95 

10 Leu Phe Asp Tyr Glu lie Glu Lys Glu Pro Leu Leu Phe His Val Phe 
100 105 110 



15 



30 



Ser Asn Gly Gly Val Met Leu Tyr Arg Tyr Val Leu Glu Leu Leu Gin 
115 120 125 

Thr Arg Arg Phe Cys Arg Leu Arg Val Val Gly Thr lie Phe Asp Ser 
130 135 140 



Ala Pro Gly Asp Ser Asn Leu Val Gly Ala Leu Arg Ala Leu Ala Ala 
20 145 150 155 160 

lie Leu Glu Arg Arg Ala Ala Met Leu Arg Leu Leu Leu Leu Val Ala 
165 170 175 

25 Phe Ala Leu Val Val Val Leu Phe His Val Leu Leu Ala Pro lie Thr 

180 185 ^ 190 



Ala Xaa Phe His Thr His Phe Tyr Asp Arg Leu Gin Asp Ala Gly Ser 
195 200 205 

Arg Trp Pro Glu Leu Tyr Leu Tyr Ser Arg Ala Asp Glu Val Val Leu 
210 215 220 



Ala Arg Asp He Glu Arg Met Val Glu Ala Arg Leu Ala Arg Arg Val 
35 225 230 235 240 

Leu Ala Arg Ser Val Asp Phe Val Ser Ser Ala His Val Ser His Leu 
245 250 255 

40 Arg Asp Tyr Pro Thr Tyr Tyr Thr Ser Leu Cys Val Asp Phe Met Arg 
260 265 270 



45 



Asn Cys Val Arg Cys Xaa 
275 



(2) INFORMATION FOR SEQ ID NO: 448: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 

55 

Met Ser Phe lie Phe Asp Trp lie Tyr Ser Gly Phe Ser Ser Val Leu 
15 10 15 

Gin Phe Leu Gly Leu Tyr Lys Lys Thr Gly Lys Leu Val Phe Leu Gly 
60 20 25 30 



iDOCtD: <WO_9839448A2 IA> 



wo 98/39448 PCT/US98/04493 

621 



10 



25 



35 



50 



Leu Asp Asn Ala Gly Lys Thr Thr Leu Leu His Met Leu Lys Asp Asp 
35 40 45 

Arg Leu Gly Gin His Val Pro Thr Leu His Pro Thr Ser Glu Glu Leu 
50 55 60 

Thr lie Ala Gly Met Thr Phe Thr Thr Phe Asp Leu Gly Gly His Val 
65 70 75 80 

Gin Ala Arg Arg Val Trp Lys Asn Tyr Leu Pro Ala He Asn Gly He 
85 90 95 



Val Phe Leu Val Asp Cys Ala Asp His Glu Arg Leu Leu Glu Ser Lys 
15 100 105 110 

Glu Glu Leu Asp Ser Leu Met Thr Asp Glu Thr He Ala Asn Val Pro 
115 120 125 

20 He Leu He Leu Gly Asn Lys He Asp Arg Pro Glu Ala He Ser Glu 
130 135 140 



Glu Arg L»eu Arg Glu Met Phe Gly Leu Tyr Gly Gin Thr Thr Gly Lys 
145 150 155 160 

Gly Ser He Ser Leu Lys Glu Leu Asn Ala Arg Pro Leu Glu Val Phe 
165 170 175 



Met Cys Ser Val Leu Lys Arg Gin Gly Tyr Gly Glu Gly Phe Arg Trp 
30 180 185 190 



Met Ala Gin Tyr He Asp Xaa 
195 



<2> INFORMATION FOR SBQ ID NO: 449: 



<i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 258 amino acids 

(B) TYPE: axnino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 

45 Met Thr Leu Ser Arg Phe Ala Tyr Asn Gly Lys Arg Cys Pro Ser Ser 
15 10 15 

Tyr Asn He Leu Asp Asn Ser Lys He He Ser Glu Glu Cys Arg Lys 
20 25 30 



Glu Leu Thr Ala Leu Leu His His Tyr Tyr Pro He Glu He Asp Pro 
35 40 45 



His Arg Thr Val Lys Glu Lys Leu Pro His Met Val Glu Trp Trp Thr 
55 50 55 60 

Lys Ala His T^n Leu Leu Cys Gin Gin Lys He Gin Lys Phe Gin He 
65 70 75 80 

60 Ala Gin Val Val Arg Glu Ser Asn Ala Met Leu Arg Glu Gly Tyr Lys 
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85 90 95 

Thr Phe Phe Asn Thr Leu Tyr His Asn Asn lie Pro Leu Phe lie Phe 
XOO 105 110 

-5 

Ser Ala Gly lie Gly Asp lie Leu Glu Glu lie lie Arg Gin Met Lys 
115 120 125 

Val Phe His Pro Asn lie His lie Val Ser Asn Tyr Met Asp Phe Asn 
10 130 135 140 

Glu Asp Gly Phe Leu Gin Gly Phe Lys Gly Gin Leu lie His Thr Tyr 
145 150 155 160 

15 Asn Lys Asn Ser Ser Val Cys Glu Asn Xaa Gly Tyr Phe Gin Gin Leu 

165 170 175 



20 



35 



45 



Glu Gly Lys Thr Asn Val lie Leu Leu Gly Asp Ser lie Gly Asp Leu 
180 185 190 

Thr Met Ala Asp Gly Val Pro Gly Val Gin Asn lie Leu Lys lie Gly 
195 200 205 



Phe Leu Asn Asp Lys Val Glu Glu Arg Arg Xaa Arg Tyr Met Asp Ser 
25 210 215 220 

Tyr Asp He Val Leu Glu Lys Asp Glu Thr Leu Asp Val Val Asn Gly 
225 230 235 240 

30 Leu Leu Gin His He Leu Cys Gin Gly Val Gin Leu Glu Met Gin Gly 

245 250 255 



Pro Xaa 



(2) INFORMATION FOR SEQ ID NO: 450: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 



Met Ser His Val Leu Leu Cys E^o Ser Leu Ser Cys Ser Asn Leu Leu 
15 10 15 



Pro Pro Ser His Ser Leu Gly Thr Met Gly Ser Leu Ser Pro His Leu 
50 20 25 30 

Cys Gly His Thr Met Cys Pro Val Asn Pro Glu Leu Pro Leu Ser Ser 
35 40 45 

55 Arg Leu Thr Thr Asp Gin Pro Gin Pro Asp Ala Cys Ser Pro Thr Leu 
50 55 60 



60 



Leu Thr Leu Pro Leu Pro Ser Ser Phe Leu Pro His Ser Lys Pro Thr 
65 70 75 80 
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Phe Xaa His Pro Cys Ser Pro 
85 



(2) INFORMATION FOR SEQ ID NO: 451: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 
10 (B) TYPE: amino acid 

<D) TOPOIiOGY: linear 
(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 451: 

Met Phe Ser lie Asn Pro Leu Glu Asn Leu Lys Val Tyr lie Ser Ser 
15 1 5 10 15 

Arg Pro Pro Leu Val Val Phe Met lie Ser Val Xaa Pro Met Ala He 
20 25 30 

20 Ala Phe Leu Thr Leu Gly Tyr Phe Phe Lys He Lys Glu He Lys Ser 
35 40 45 



25 



40 



55 



Pro Glu Met Ala Glu Asp Trp Asn Thr Phe Leu Leu Arg Phe Asn Asp 
50 55 60 

Leu Asp Leu Cys Val Ser Glu Asn Glu Thr Leu Lys His Leu Thr Asn 
65 70 75 80 



Asp Thr Thr Thr Pro Glu Ser Thr Met Thr Ser Gly Gin Ala Arg Ala 
30 85 90 95 

Ser Thr Gin Ser Pro Gin Ala Leu Glu Asp Ser Gly Pro Val Asn He 
100 105 110 

35 Ser Val Ser He Thr Leu Thr Leu Asp Pro Leu Lys Pro Phe Gly Gly 
115 120 125 



Tyr Ser Arg Asn Val Thr His Leu Tyr Ser Thr He Leu Gly His Gin 
130 135 140 

He Gly Leu Ser Gly Arg Glu Ala His Glu Glu He Asn He Thr Phe 
145 150 155 160 



Thr Leu Pro Thr Ala Trp Ser Ser Asp Asp Cys Ala Leu His Gly His 
45 165 170 175 

Cys Glu Gin Val Val Phe Thr Ala Cys Met Thr Leu Thr Ala Ser Pro 
180 185 190 

50 Gly Val Phe Pro Val Thr Val Gin Pro Pro His Cys Val Pro Asp Thr 
155 200 205 



Tyr Ser Asn Ala Thr Leu Trp Tyr Lys He Phe Thr Thr Ala Arg Asp 
210 215 220 

Ala Asn Thr Lys Tyr Ala Gin Asp Tyr Asn Pro Phe Trp Cys Tyr Lys 

225 230 235 240 



Gly Ala He Gly Lys Val Tyr His Ala Leu Asn Pro Lys Leu Thr Val 
60 245 250 255 
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lie Val Pro Asp Asp Asp Arg Ser Leu He Asn Leu His Leu Met His 
260 265 270 

-5 Thr Ser Tyr Phe Leu Phe Val Met Val He Thr Met Phe Cys Tyr Ala 
275 280 285 

Val He Lys Gly Arg Pro Ser Lys Leu Arg Gin Ser Asn Pro Glu Phe 
290 295 300 

10 

Cys Pro Glu Lys Val Ala Leu Ala Glu Ala Xaa 
305 310 315 



15 



35 



45 



<2) INFORMATION FOR SEQ ID NO: 452: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 
20 (B) TyPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 

Met Pro Gly Leu Ser Leu Ala Leu Leu Pro Phe Gly Pro Gly Cys Thr 
25 1 5 10 15 

Glu Ala Leu His Ala Gly ^s Phe Pro Ala Phe Ala Ser Ala Thr Arg 
20 25 30 

30 Val Asn Gly Glu Ala Ala Leu Ser Pro Gly Leu Cys Asp Pro He Ser 
35 40 45 



Val Pro Tyr Val 
50 



<2) INFORMATION FOR SEQ ID NO: 453: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 



Met Ala Val Gly Gin He Met Thr Phe Gly Ser Pro Val He Gly Cys 
1 5 10 X5 



Gly Phe He Ser Gly Trp Asn Leu Val Ser Met Cys Val Glu Tyr Val 
50 20 25 30 

Leu Leu Trp Lys Val Tyr Gin Lys Thr Pro Ala Leu Ala Val Lys Ala 
35 40 45 

55 Gly Leu Lys Glu Glu Glu Thr Glu Leu Lys Gin Leu Asn Leu His Lys 
50 55 60 



60 



Asp Thr Glu Pro Lys Pro Leu Glu Gly Thr His Leu Met Gly Val Lys 
65 70 75 80 
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Asp Ser Asn lie His Giu Leu Glu His Glu Gin Glu Pro Thr Cys Ala 
85 90 95 

Ser Gin Met Ala Glu Pro Phe Arg Thr Phe Arg Asp Gly Trp Val Ser 
- 5 XOO 105 110 

Tyr Tyr Asn Gin Pro Val Phe Leu Ala Gly Met Gly Leu Ala Phe Leu 
115 120 125 

10 Tyr Met Thr Val Leu Gly Phe Asp Cys lie Thr Thr Gly Tyr Ala Tyr 
130 135 140 



15 



30 



45 



Thr Gin Gly Leu Ser Gly Phe His Pro Gin Tyr Phe Asp Gly Ser lie 
145 150 155 160 

Ser Tyr Asn Trp Asn Asn Gly Asn Cys Ser Phe Tyr Leu Ala Thr Ser 
165 170 175 



Lys Met Trp Phe Gly Ser Ala Gly Leu lie Ser Gly Leu Ala Gin Leu 
20 180 185 190 

Ser Cys Leu lie Leu Cys Val lie Ser Val Phe Met Pro Gly Ser Pro 
195 200 205 

25 Leu Asp Leu Ser Val Ser Pro Phe Glu Asp lie Arg Ser Arg Phe lie 
210 215 220 



Gin Gly Glu Ser lie Thr Pro Thr Lys lie Pro Glu He Thr Thr Glu 
225 230 235 240 

lie Tyr Met Ser Asn Gly Ser Asn Ser Ala Asn lie Val Pro Glu Thr 
245 250 255 



Ser Pro Glu Ser Val Pro He lie Ser Val Ser Leu Leu Phe Ala Gly 
35 260 265 270 

Val He Ala Ala Arg He Gly Leu ^Trp Ser^ Phe ASp Leu Thr Val Thr 
275 280 285 

40 Gin Leu Leu Gin Glu Asn Val He Glu Ser Glu Arg Gly He He Asn 
290 295 300 



Gly Val Gin Asn Ser Met Asn Tyr Leu Leu Asp Leu Leu His Phe He 

305 310 315 320 

Met Val He Leu Ala Pro Asn Pro Glu Ala Phe Gly Leu Leu Val Leu 
325 330 335 



He Ser Val Ser Phe Val Ala Met Gly His He Met Tyr Phe Arg Phe 
50 340 345 350 

Ala Gin Asn Thr Leu Gly Asn Lys Leu Phe Ala Cys Gly Pro Asp Ala 
355 360 365 

55 Lys Glu Val Arg Lys Glu Asn Gin Ala Asn Thr Ser Val Val Xaa 
370 375 380 



60 (2) INFORMATION FOR SEQ ID NO: 454: 
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10 



25 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454: 

Met Arg Ser lie Gly Asn Lys Asn Thr lie Leu Leu Gly Leu Gly Phe 
15 10 15 

Gin He Leu Gin Leu Ala Trp Tyr Gly Phe Gly Ser Glu Pro Trp Met 
20 25 30 



Met Trp Ala Ala Gly Ala Val Ala Ala Met Ser Ser He Thr Phe Pro 
15 35 40 45 

Ala Val Ser Ala Leu Val Ser Arg Thr Ala Asp Ala Asp Gin Gin Gly 
50 55 60 

20 Val Val Gin Gly Met He Thr Gly He Arg Gly Leu Cys Asn Gly Leu 
65 70 75 80 



Gly Pro Ala Leu Tyr Gly Phe He Phe Tyr He Phe His Val Glu Leu 
85 90 95 

Lys Glu Leu Pro He Thr Gly Thr Asp Leu Gly Thr Asn Thr Ser Pro 
100 105 110 



Gin His His Phe Glu Gin Asn Ser He He Pro Gly Pro Pro Phe Leu 
30 115 120 125 

Phe Gly Ala Cys Ser Val Leu Leu Ala Leu Leu Val Ala Leu Phe He 
130 135 140 

35 Pro Glu His Thr Asn Leu Ser Leu Arg Ser Ser Ser Trp Arg Lys His 

145 . „ 150 - - - X55- - - - - ~ - 160- 



Cys Gly Ser His Ser His Pro His Asn Thr Gin Ala Pro Gly Glu Ala 
165 170 175 

Lys Glu Pro Leu Leu Gin Asp Thr Asn Val 
180 185 



(2) INFORMATION FOR SEQ ID NO: 455: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 163 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 

Met Leu Gin Thr Ser Asn Tyr Ser Leu Val Leu Ser Leu Gin Phe Leu 
55 1 5 10 15 

Leu Leu Ser Tyr Asp Leu Phe Val Asn Ser Phe Ser Glu Leu Leu Gin 
20 25 30 



60 Lys Thr Pro Val He Gin Leu Val Leu Phe He He Gin Asp He Ala 
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35 40 45 

Val Leu Phe Asn lie lie lie lie Phe Leu Met Phe Phe Asn Thr Phe 
50 55 60 

-5 

Val Phe Gin Ala Gly Leu Val Asn Leu Leu Phe His Lys Phe Lys Gly 
65 70 75 80 

Thr lie He Leu Thr Ala Val Tyr Phe Ala Leu Ser He Ser Leu His 
10 85 90 95 

Val Trp Val Met Asn Leu Arg Trp Lys Asn Ser Asn Ser Phe He Trp 
100 105 110 

15 Thr Asp Gly Leu Gin Met Leu Phe Val Phe Gin Arg Leu Ala Ala Val 
115 120 125 



20 



25 



Leu Tyr Cys Tyr Phe Tyr Lys Arg Thr Ala Val Arg Leu Gly Asp Pro 
130 135 140 

His Phe Tyr Gin Asp Ser Leu Trp Leu Arg Lys Glu Phe Met Gin Val 
145 150 155 160 

Arg Arg Xaa 



30 



(2) INFORMATION FOR SEQ ID NO: 456: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 

Met Arg He Gin Va"l "PHe~Ile T;eu" Leu~Ceir Gry Ala "G^ 

15 10 15 

40 Gin Phe Thr Lys Pro Pro Ser Leu Pro Leu Glu Pro Glu Pro Ala Val 
20 25 30 

Glu Ser Ser Pro Thr Glu Thr Ser Glu Gin He Arg Glu Lys 
35 40 45 

45 



(2) INFORMATION FOR SEQ ID NO: 457; 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 

55 

Met Ser Tyr Leu Ala Phe Leu Tyr Met Thr Phe Asp Phe Cys Cys Leu 
15 10 15 

Tyr Phe Ser Thr Val Tyr Ala Pro Ser Phe Lys Tyr He Cys Val His 
60 20 25 30 
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Thr Asp Thr His lie Cys Val Cys Val Cys lie Tyr Leu Ser Ser Val 
35 40 45 

-5 Val Ser Lys Ser Ser Ala Glu Ala Asp Gly Val Leu Gin Pro Arg Arg 
50 55 60 

His Pro Ala Ser Leu Leu lie Val Phe Ala Thr Ser lie Ser Glu Ser 
65 70 75 80 



10 



20 



35 



45 



Ser Leu Leu lie Phe Ser Phe Gin Lys Thr Glu Ala Lys Leu He Val 
85 90 95 



Phe Ala Val Ser Leu Ala Ala Lys Xaa 
15 100 105 



(2) INFORMATION FOR SEQ ID NO: 458: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LJENGTH: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY- : linear 

25 (Xi) SEQLFENCE DESCRIPTION: SEQ ID NO: 458: 

Met Leu Pro Pro Phe Ser Leu Val Tyr Thr His Phe Leu Val Ala Ser 
15 10 15 

30 Leu Leu Pro Val lie Leu Ala Val Phe Pro Asp Ser Ala Gin He Val 
20 25 30 

Pro Leu Leu Lys Pro He Pro Arg Pro Gin Pro Glu Val He Phe Pro 
35 40 45 



_Ser Ser Glu Leu Leu. Glu Qln. Leu Leu Ser. Val Gin _Pjhe_ Val Trp Gin 
50 55 60 



Ala His Thr Val Ala Xaa 
40 65 70 



(2) INFORMATION FOR SEQ ID NO: 459: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 

Met Ala Leu Leu Leu Ser Val Leu Arg Val Leu Leu Gly Gly Phe Phe 
15 10 15 

55 Ala Leu Val Gly Leu Ala Lys Leu Ser Glu Glu He Ser Ala Pro Val 
20 25 30 

Ser Glu Arg Met Asn Ala Leu Phe Val Gin Phe Ala Glu Val Phe Pro 
35 40 45 

60 
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Leu Lys Val Phe Gly Tyr Gin 
50 55 

Val Gly Phe Leu Glu Leu Leu 
_5 65 70 

Pro Met Leu Gin Glu lie Ser 
85 

10 Gly Ala lie Phe Thr Leu Ala 
100 

lie Pro Ala lie Val Cys Leu 
115 

15 

Gin Leu Leu Ala Gin Thr Lys 
130 135 

Thr Leu Ser Thr Phe Lys Glu 
20 145 150 



Pro Asp Pro Leu Asn Tyr Gin lie Ala 
60 

Ala Gly Leu Leu Leu Val Met Gly Pro 
75 80 

Asn Leu Phe Leu lie Leu Leu Met Met 
90 95 

Ala Leu Lys Glu Ser Leu Ser Thr Cys 
105 110 

Gly Phe Leu Leu Leu Leu Asn Val Gly 
120 125 

Lys Val Val Arg Pro Thr Arg Lys Lys 
140 

Ser Trp Lys Xaa 
155 



25 



(2) INFORMATION FOR SEQ ID NO: 460: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 

Met Lys Leu Gly Arg Ala Val Leu Gly Leu Leu Leu Leu Ala Pro Ser 
15 10 15 

35 Val Val Gin Ala Val Glu Pro lie Ser Leu Gly Leu Ala Leu Ala Gly 
20 25 30 



40 



55 



Val Leu Thr Gly Tyr lie Tyr Pro Arg Leu Tyr Cys Leu Phe Ala Glu 

35 40 45 

Cys Cys Gly Gin Lys Arg Ser Leu Ser Arg Glu Ala Leu Gin Lys Asp 
50 55 60 



Leu Asp Asp Asn Leu Phe Gly Gin His Leu Ala Lys Lys lie lie Leu 
45 65 70 75 80 

Asn Ala Val Phe Gly Phe lie Asn Asn Pro Lys Pro Lys Lys Pro Leu 
85 90 95 

50 Thr Leu Ser Leu His Gly Trp Thr Gly Thr Gly Lys Asn Phe Val Ser 
100 105 110 



Lys lie lie Ala Glu Asn lie Tyr Glu Gly Gly Leu Asn Ser Asp Tyr 
115 120 125 

Val His Leu Phe Val Ala Thr Leu His Phe Pro His Ala Ser Asn lie 
130 135 140 



Thr Leu Tyr Lys Asp Gin Leu Gin Leu Trp lie Arg Gly Asn Val Ser 
60 145 150 155 160 
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Ala Cys Ala Arg Ser lie Phe lie Phe Asp Glu Met Asp Lys Met His 
165 170 175 

5 Ala Gly Leu lie Asp Ala He Lys Pro Phe Leu Asp Tyr Tyr Asp Leu 
180 185 190 

Val Asp Gly Val Ser Tyr Gin Lys Ala Met Phe lie Phe Leu Ser Asn 
195 200 205 



10 



Ala Gly Ala Glu Arg He Thr Asp Val Ala Leu Asp Phe Trp Arg Ser 
210 215 220 



Gly Lys Gin Arg Glu Asp He Lys Leu Lys Asp He Glu His Ala Leu 
15 225 230 235 240 

Ser Val Ser Val Phe Asn Asn Lys Asn Ser Gly Phe Trp His Ser Ser 
245 250 255 

20 Leu He Asp Arg Asn Leu He Asp Tyr Phe Val Pro Phe Leu Pro Leu 
260 265 270 



25 



Glu Tyr Lys His Leu Lys Met Cys He Arg Val Glu Met Gin Ser Arg 
275 280 285 

Gly Tyr Glu He Asp Glu Asp He Val Ser Arg Val Ala Glu Glu Met 
290 295 300 



Thr Phe Phe Pro Lys Glu Glu Arg Val Phe Ser Asp Lys Gly Cys Lys 
30 305 310 315 320 



35 



Thr Val Phe Thr Lys Leu Asp Tyr Tyr Tyr Asp Asp 
325 330 



(2) INTORMATION FOR SEQ ID NO: 461: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 5 amino acids 

(B) TVPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 461: 

45 Met Leu Lys Cys He 
1 5 



50 (2) INFORMATION FOR SEQ ID NO: 462: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 14 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 

Met He Leu Thr Leu Leu Ser Val Val Ser Thr Met Ala Ser 
15 10 

60 
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(2) INFORMATION FOR SEQ ID NO: 463: 

-5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 



10 



25 



40 



55 



Met Lys Leu His Pro Pro Pro Pro Ser Pro Val Thr Gin Asp His Arg 
IS 10 15 



Ser Lys Ser Ser His Ser Asn Trp Met Pro Arg Met Gly Ala Cys Ser 
15 20 25 30 

Met Ser Arg Thr Ser Ser Ser Gly Pro Pro Ser Leu Cys Lys Ser Thr 
35 40 45 

20 Ser Gly Arg Ser Cys Thr Arg Pro His Cys Trp Pro Ser Leu Pro Ala 
50 55 60 



Trp Val Ser Val Phe Thr Arg Thr Asn Thr Gly Ser Trp Cys Tyc Pro 
65 70 75 80 

Ala Trp Gly Gly Ala Phe Ser Arg Pro Trp Met Ser Ala Gin Ser Met 
85 90 95 



Cys Cys Ala Glu Arg Ser Val Leu Gin Val Ala Cys Arg Leu Leu Asp 
30 100 105 110 

Ala Leu Glu Phe Leu His Glu Asn Glu Tyr Val His Gly Asn Val Thr 
115 120 125 

35 Ala Glu Asn lie Phe Val Asp Pro Glu Asp Gin Ser Gin Val Thr Leu 
130 135 140 



Ala Gly Tyr Gly Phe Ala Phe Arg Tyr Cys Pro Ser Gly Lys His Val 
145 150 155 160 

Ala Tyr Val Glu Gly Ser Arg Ser Pro His Glu Gly Asp Leu Glu Phe 
165 170 175 



lie Ser Met Asp Leu His Lys Gly Cys Gly Pro Ser Arg Arg Xaa Asp 
45 180 185 190 

Leu Gin Ser Leu Gly Tyr Cys Met Leu Lys Trp Leu Tyr Gly Phe Leu 
195 200 205 

50 Pro Trp Thr Asn Cys Leu Pro Xaa Xaa Glu Asp lie Met Lys Gin Lys 
210 215 220 



Gin Lys Phe Val Asp Lys Pro Gly E^o Phe Val Gly Pro Cys Gly His 
225 230 235 240 

Trp He Arg Pro Ser Glu Thr Leu Gin Lys Tyr Leu Lys Val Val Met 
245 250 255 



Ala Leu Thr Tyr Glu Glu Lys Pro Pro Tyr Ala Met Leu Arg Asn Asn 
60 260 265 270 
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Leu Glu Ala Leu Leu Gin Asp Leu Arg Val Ser Pro Tyr 
275 280 285 



(2) INFORMATION FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 

15 Met Thr Ser Pro Pro Pro His Gin Gly Trp Glu Gin Arg Gly Cys Gly 
15 10 15 



20 



Glu Ser Gin Val Pro Leu Ala Leu Ser Arg Val Phe Ser Thr Ser His 
20 25 30 

Tyr Cys Leu Leu Leu Val Ala Asn Gin Ser lie Phe Phe Pro Cys Leu 
35 40 45 



Trp Ala Val Glu Arg Leu Leu Gly Val Arg Cys Thr Cys Pro Leu Ser 
25 50 55 60 



Trp Gly Lys Arg He He Ser Glu His Cys Ser Ala Gin Ser Ser Xaa 
65 70 75 80 



30 



35 (2) INFORMATION FOR SEQ ID NO: 465: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465: 



45 



Met His Thr Trp Tyr Asn Asp Arg Arg Gin Asn Cys His Cys Leu Leu 
15 10 15 

Phe Phe Leu He Tyr Leu Arg Lys He Tyr Gin Val Val Pro His Val 
20 25 30 



Pro Leu Leu Val Lys Cys Arg Gly Arg Leu Lys Gly Val Asn He 
50 35 40 45 



(2) INFORMATION FOR SEQ ID NO: 466: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 
(P) TOPOLOGY: lineeur 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 
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Met Glu Leu Val Leu Val Phe 
1 5 

-5 Leu Ala Ser Ala Ala Glu Lys 
20 

Asp Tyr Gin Thr Leu Arg He 
35 

10 

Phe Ser Val Gly He Leu Leu 
50 55 

Phe Asn Gin Lys Pro Arg Ala 
15 65 70 

Asn Leu He Thr Ala Asn Ala 
85 

20 



Leu Cys Ser Leu Leu Ala Pro Met Val 
10 15 

Glu Lys Glu Met Asp Pro Phe His Tyr 
25 30 

Gly Gly Leu Val Phe Ala Val Val Leu 
40 45 

He Leu Ser Arg Arg Cys Lys Cys Ser 
60 

Pro Gly Asp Glu Glu Ala Gin Val Glu 
75 80 

Thr Glu Pro Gin Lys Ala Glu Asn Xaa 
90 95 



25 (2) INFORMATION FOR SEQ ID NO: 467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 



35 



50 



Met Ala Ser Gly Ala Asp Ser Lys Gly Asp Asp Leu Ser Thr Ala He 
15 10 15 

Leu Lys Gin Lys Asn Arg Pro Asn Arg Leu He Val Asp Glu Ala He 

20 - ~ 2S ' " ^30^ " 



Asn Glu Asp Asn Ser Val Val Ser Leu Ser Gin Pro Lys Met Asp Glu 
40 35 40 45 

Leu Gin Leu Phe Arg Gly Asp Thr Val Leu Leu Lys Gly Lys Lys Arg 
50 55 60 

45 Arg Glu Ala Val Cys He Val Leu Ser Asp Asp Thr Cys Ser Asp Glu 
65 70 75 80 



Lys He Arg Met Asn Arg Val Val Arg Asn Asn Leu Arg Val Arg Leu 
85 90 95 

Gly Asp Val He Ser He Gin Pro Cys Pro Asp Val Lys Tyr Gly Lys 
100 105 HO 



Arg He His Val Leu Pro He Asp Asp Thr Val Glu Gly He Thr Gly 
55 115 120 125 

Asn Leu Phe Glu Val Tyr Leu Lys Pro Tyr Phe Leu Glu Ala Tyr Arg 
130 135 140 

60 Pro He. Arg Lys Gly Asp He Phe Leu Val Arg Gly Gly Met Arg Ala 
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145 150 155 160 

Val Glu Phe Lys Val Val Glu Thr Asp Pro Ser Pro Tyr Cys He Val 
165 170 175 

-5 

Ala Pro Asp Thr Val He His Cys Glu Gly Glu Pro He Lys Arg Glu 
180 185 190 

Asp Glu Glu Glu Ser Leu Asn Glu Val Gly Tyr Asp Asp He Gly Gly 
10 195 200 205 

Cys Arg Lys Gin Leu Ala Gin He Lys Glu Met Val Glu Leu Pro Leu 
210 215 220 

15 Arg His Pro Ala Leu Phe Lys Ala He Gly Val Lys Pro Pro Arg Gly 
225 230 235 240 



20 



35 



He Leu Leu Tyr Gly Pro Pro Gly Thr Gly Lys Thr Leu He Ala Arg 
245 250 255 

Ala Val Ala Asn Glu Thr Gly Ala Phe Phe Phe Leu He Asn Gly Pro 
260 265 270 



Glu He Met Ser Lys Leu Ala Gly Glu Ser Glu Ser Asn Leu Arg Lys 
25 275 280 285 

Ala Phe Glu Glu Ala Glu Lys Asn Ala Pro Ala He He Phe He Asp 
290 295 300 

30 Glu Leu Asp Ala He Ala Pro Lys Arg Glu Lys Thr His Gly Glu Val 
305 310 315 320 



Glu Arg Arg He Val Ser Gin Leu Leu Thr Leu Met Asp Gly Leu Lys 
325 330 335 

Gin Arg Ala His Val He Val Met Ala Ala Thr Asn Arg Pro Asn Ser 
340 345 350 



He Asp Pro Ala Leu Arg Arg Phe Gly Arg Phe Asp Arg Glu Val Asp 
40 355 360 365 

He Gly He Pro Asp Ala Thr Gly Arg Leu Glu He Leu Gin He His 
370 375 380 

45 Thr Lys Asn Met Lys Leu Ala Asp Asp Val Asp Leu Glu Gin Xaa 
385 390 395 



50 (2) INFORMATION FOR SEQ ID NO: 468: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 



60 



Leu 
1 
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10 



25 



40 



(2) INFORMATION FOR SEQ ID NO: 469: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 

Met Ala Ala Pro Lys Gly Ser Leu Trp Val Arg Thr Gin Leu Gly Leu 
15 10 15 



Pro Pro Leu Leu Leu Leu Thr Met Ala Leu Ala Gly Gly Ser Gly Thr 
15 20 25 30 

Ala Ser Ala Glu Ala Phe Asp Ser Val Leu Gly Asp Thr Ala Ser Cys 
35 40 45 

20 His Arg Ala Cys Gin Leu Thr Tyr Pro Leu His Thr Tyr Pro Lys Glu 
50 55 60 



Glu Glu Leu Tyr Ala Cys Gin Arg Gly Cys Arg Leu Phe Ser lie Cys 
65 70 75 80 

Gin Phe Val Asp Asp Gly lie Asp Leu Asn Arg Thr Lys Leu Glu Cys 
85 90 95 



Glu Ser Ala Cys Thr Glu Ala Tyr Ser Gin Ser Asp Glu Gin Tyr Ala 
30 100 105 110 

Cys His Leu Gly Cys Gin Asn Gin Leu Pro Phe Ala Glu Leu Arg Gin 
115 120 125 

35 Glu Gin Leu Met Ser Leu Met Pro Lys Met His Leu L-eu Phe Pro Leu 
130 135 140 



Thr Leu Val Arg Ser Phe Trp Ser Asp Met Met Asp Ser Ala Gin Ser 
145 150 155 160 

Phe lie Thr Ser Ser Trp Thr Phe Tyr Leu Gin Ala Asp Asp Gly Lys 
165 170 175 



lie Val lie Phe Xaa Ser Lys Pro Arg Asn Pro Arg Tyr Ala Pro His 
45 180 185 190 

Leu Glu Pro Gly Ala Leu Pro Asn Leu Xeia Xaa Xaa Ser Leu Ser Lys 
195 200 205 

50 Met Ser Xaa Xaa Ser Xaa Met Arg Asn Ser Gin Ala His Arg Asn Phe 
210 215 220 



55 



Leu Glu Asp Gly Glu Ser Asp Gly Phe Leu Arg Cys Leu Ser Leu Asn 
225 230 235 240 

Ser Gly Trp lie Leu Thr Thr Thr Leu Val Leu Ser Val Met Val Leu 
245 250 255 



Leu Trp lie Cys Cys Ala Thr Cys Cys Tyr Thr Leu Leu Asp Ala Val 
60 260 265 270 
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(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: lineeu: 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 

15 Met Met Val Leu Ser Leu Gly lie lie Leu Ala Ser Ala Ser Phe Ser 
15 10 15 



20 



35 



Pro Asn Phe Thr Gin Val Thr Ser Thr Leu Leu Asn Ser Ala Tyr Pro 
20 25 30 

Phe lie Gly Pro Phe Phe Phe lie lie Ser Gly Ser Leu Ser lie Ala 
35 40 45 



Thr Glu Lys Arg Leu Thr Lys Leu Leu Val His Ser Ser Leu Val Gly 
25 50 55 60 

Ser lie Leu Ser Ala Leu Ser Ala Leu Val Gly Phe lie lie Leu Ser 
65 70 75 80 

30 Val Lys Gin Ala Thr Leu Asn Pro Ala Ser Leu Gin Cys Glu Leu Asp 

85 90 95 



Lys Asn Asn lie Pro Thr Arg Ser Tyr Val Ser Tyr Phe Tyr His Asp 
100 105 110 

Ser Leu Tyr Thr Thr Asp Cys Tyr Thr Ala Lys Ala Ser Leu Ala Gly 
115 120 125 



Xaa Leu Ser Leu Met Leu lie Cys Thr Leu Leu Glu Phe Cys Leu Ala 
40 130 135 140 

Val Leu Thr Ala Val Leu Arg Trp Lys Gin Ala Tyr Ser Asp Phe Pro 
145 150 155 160 

45 Gly Ser Val Leu Phe Leu Pro His Ser Tyr lie Gly Asn Ser Gly Met 

165 170 175 



50 



Ser Ser Lys Met Thr His Asp Cys Gly Tyr Glu Glu Leu Leu Thr Ser 
180 185 190 



55 

(2) INFORMATION FOR SEQ ID NO: 471: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 amino acids 
60 (B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 471: 

Met Arg Lys Thr Arg Leu Trp Gly Leu Leu Trp Met Leu Phe Val Ser 
-5 1 5 10 15 

Glu Leu Arg Ala Ala Thr Lys Leu Thr Glu Glu Lys Tyr Glu Leu Lys 
20 25 30 

10 Glu Gly Gin Thr Leu Asp Val Lys Cys Asp Tyr Thr Leu Glu Lys Phe 
35 40 45 



15 



30 



45 



Ala Ser Ser Gin Lys Ala Trp Gin lie lie Arg Asp Gly Glu Met Pro 
50 55 60 

Lys Thr Leu Ala Cys Thr Glu Arg Pro Ser Lys Asn Ser His Pro Val 
65 70 75 80 



Gin Val Gly Arg lie lie Leu Glu Asp Tyr His Asp His Gly Leu Leu 
20 85 90 95 

Arg Val Arg Met Val Asn Leu Gin Val Glu Asp Ser Gly Leu Tyr Gin 
100 105 110 

25 Cys Val lie Tyr Gin Pro Pro Lys Glu Pro His Met Leu Phe Asp Arg 
115 120 125 



lie Arg Leu Val Val Thr Lys Gly Phe Ser Gly Thr Pro Gly Ser Asn 
130 135 140 

Glu Asn Ser Thr Gin Asn Val Tyr Lys He Pro Pro Thr Thr Thr Lys 
145 150 155 160 



Ala Leu Cys Pro Leu Tyr Thr Ser Pro Arg Thr Val Thr Gin Ala Pro 
35 165 170 175 

Pro Lys Ser Thr Ala Asp Val Ser Thr Pro Asp Ser Glu He Asn Leu 
180 185 190 

40 Thr Asn Val Thr Asp He He Arg Val Pro Val Phe Asn He Val He 
195 200 205 



Leu Leu Ala Gly Gly Phe Leu Ser Lys Ser Leu Val Phe Ser Val Leu 
210 215 220 

Phe Ala Val Thr Leu Arg Ser Phe Val Pro 
225 230 



50 

(2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

Met Leu His He Leu Pro Leu Lys Ser Tyr Asp Phe Pro His Phe Ser 
60 1 5 10 15 
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Leu Met Gly Arg Tyr Arg Cys Ala Ser Leu Leu Phe Cys Phe Leu Leu 
20 25 30 

- 5 Leu Phe Phe Phe Phe Cys Ser Val Leu Trp Thr Phe Ser Asp Met His 
35 40 45 

Arg Ser Gly Glu Asp Gly Pro Trp Thr Pro Cys Val His His Leu Ala 
50 55 60 

10 

Ala Ser Leu lie Ser Tyr Gly Gin Pro Gly Phe He Cys He Ser Leu 
65 70 75 80 

Phe Ser Pro Val Leu Phe He Glu Asn Pro Arg His Tyr Ala Asn Ala 
15 85 90 95 

Thr Val Thr Thr Leu Gly Asp Trp Xaa 
100 105 

20 

(2) INFORMATIC»J FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

30 Met Val Phe Leu Lys Tyr Arg Phe Leu Phe Phe Leu Val Phe Leu Ala 
15 10 15 

Asn Cys He Tyr Ser Leu His Tyr Lys Pro Ser Leu Met Tyr Pro Lys 
20 25 . 30 

35 



40 

<2) INFORMATIC»I FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474: 

Met Ala Leu Ser Arg Gly Leu Pro Arg Glu Leu Ala Glu Ala Val Ala 
50 1 5 10 15 

Gly Gly Arg Val Leu Val Val Gly Ala Gly Gly He Gly Cys Glu Leu 
20 25 30 

55 Leu Lys Asn Leu Val Leu Thr Gly Phe Ser His He Asp Leu He Asp 
35 40 45 

Leu Asp Thr He Asp Val Ser Asn L^eu Asn Arg Gin Phe Leu Phe Gin 
50 55 60 

60 
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Lys Lys His Val Gly Arg Ser Lys Ala Gin Val Ala Lys Glu Ser Val 

70 75 . QQ 

Leu Gin Phe Tyr Pro Lys Ala Asn He Val Ala Tyr His Asp Ser He 
_5 85 90 95 

Met Asn Pro Asp Tyr Asn Val Glu Phe Phe Arg Gin Phe He Leu Val 
100 105 110 

10 Met Asn Ala Leu Asp Asn Arg Ala Ala Arg Asn His Val Asn Arg Met 
115 120 125 



15 



30 



45 



60 



Cys Leu Ala Ala Asp Val Pro Leu He Glu Ser Gly Thr Ala Gly Tyr 
130 135 140 

Leu Gly Gin Val Thr Thr He Lys Lys Gly Val Thr Glu Cys Tyr Glu 
145 150 155 160 



Cys His Pro Lys Pro Thr Gin Arg Thr Phe Pro Gly Cys Thr He Arg 
20 165 170 175 

Asn Thr Pro Ser Glu Pro He His Cys He Val Trp Ala Lys Tyr Leu 
180 185 190 

25 Phe Asn Gin Leu Phe Gly Glu Glu Asp Ala Asp Gin Glu Val Ser Pro 
195 200 205 



Asp Arg Ala Asp Pro Glu Ala Ala Trp Glu Pro Thr Glu Ala Glu Ala 
210 215 220 

Arg Ala Arg Ala Ser Asn Glu Asp Gly Asp He Lys Arg He Ser Thr 
225 230 235 240 



Lys Glu Trp Ala Lys Ser Thr Gly Tyr Asp Pro Val Lys Leu Phe Thr 
35 245 250 255 

Lys Leu Phe Lys Asp Asp He Arg Tyr Leu Leu Thr" Met Asp Lys Leu 
260 265 270 

40 Trp Arg Lys Arg Lys Pro Pro Val Pro Leu Asp Trp Ala Glu Val Gin 
275 280 285 



Ser Gin Gly Glu Glu Thr Asn Ala Ser Asp Gin Gin Asn Glu Pro Gin 
290 295 300 

Leu Gly Leu Lys Asp Gin Gin Val Leu Asp Val Lys Ser Tyr Ala Arg 
305 310 315 320 



Leu Phe Ser Lys Ser He Glu Thr Leu Arg Val His Leu Ala Glu Lys 
50 325 330 335 

Gly Asp Gly Ala Glu Leu He Trp Asp Lys Asp Asp Pro Ser Ala Met 
340 345 350 

55 Asp Phe Val Thr Ser Ala Ala Asn Leu Arg Met His He Phe Ser Met 
355 360 365 



Asn Met Lys Ser Arg Phe Asp He Lys Ser Met Ala Gly Asn He He 
370 375 380 
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Pro Ala lie Ala Thr Thr Asn Ala Val lie Ala Gly Leu lie Val Leu 
385 390 395 400 

Glu Gly Leu Lys He Leu Ser Gly Lys He Asp Gin Cys Arg Thr He 
.5 405 410 415 

Phe Leu Asn Lys Gin Pro Asn Pro Arg Lys Lys Leu Leu Val Pro Cys 
420 425 430 

10 Ala Leu Asp Pro Pro Asn Pro Asn Cys Tyr Val Cys Ala Ser Lys Pro 
435 440 445 



15 



30 



40 



55 



Glu Val Thr Val Arg Leu Asn Val His Lys Val Thr Val Leu Thr Leu 
450 455 460 

Gin Asp Lys He Val Lys Glu Lys Phe Ala Met Val Ala Pro Asp Val 
465 470 475 480 



Gin He Glu Asp Gly Lys Gly Thr He Leu He Ser Ser Glu Glu Gly 
20 485 490 495 

Glu Thr Glu Ala Asn Asn His Lys Lys Leu Ser Glu Phe Gly He Arg 
500 505 510 

25 Asn Gly Ser Arg Leu Gin Ala Asp Asp Phe Leu Gin Asp Tyr Thr Leu 
515 520 525 



Leu He Asn He Leu His Ser Glu Asp Leu Gly Lys Asp Val Glu Phe 
530 535 540 

Glu Val Val Gly Asp Ala Pro Glu Lys Val Gly Xaa Lys Gin Ala Glu 
545 550 555 560 



Asp Ala Ala Lys Ser He Thr Asn Gly Gin Xaa 
35 565 570 



(2) INFORMATION FOR SEQ ID NO: 475: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 

Met Gin Val Val Thr Cys Leu Thr Arg Asp Ser Tyr Leu Thr His Cys 
15 10 15 

50 Phe Leu Gin His Leu Met Val Val Leu Ser Ser Leu Glu Arg Thr Pro 
20 25 30 

Ser Pro Glu Pro Val Asp Lys Asp Phe Tyr Ser Glu Phe Gly Asn Lys 
35 40 45 



Thr Thr Gly Lys Met Glu Asn Tyr Glu Leu He His Ser Ser Arg Val 
50 55 60 



Lys Phe Thr Tyr Pro Ser Glu Glu Glu He Gly Asp Leu Thr Phe Thr 
60 65 70 75 80 
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10 



25 



40 



Val Ala Gin Lys Met Ala Glu Pro Glu Lys Ala Pro Ala Leu Ser lie 
85 90 95 

Leu Leu Tyr Val Gin Ala Phe Gin Val Gly Met Pro Pro Pro Gly Cys 
100 105 110 

Cys Arg Gly Pro Leu Arg Pro Lys Thr Leu Leu Leu Thr Ser Ser Glu 
115 120 125 

lie Phe Leu Leu Asp Glu Asp Cys Val His Tyr Pro Leu Pro Glu Phe 
130 135 140 



Ala Lys Glu Pro Pro Gin Arg Asp Arg Tyr Arg Leu Asp Asp Gly Arg 
15 145 150 155 160 

Arg Val Arg Asp Leu Asp Arg Val Leu Met Gly Tyr Gin Thr Tyr Pro 
165 170 175 

20 Gin Pro Ser Pro Ser Ser Ser Met Thr Cys Lys Val Met Thr Ser Trp 
180 185 190 



Ala Val Ser Pro Trp Thr Thr Leu Gly Arg Cys Gin Val Ala Arg Leu 

195 200 205 

Glu Pro Ala Arg Ala Val Lys Ser Ser Gly Arg Cys Leu Ser Pro Val 
210 215 220 



Leu Arg Ala Glu Arg Ser Ser Ser Arg Cys Trp Leu Ala Ser Gly Arg 
30 225 230 235 240 

Pro Cys Val Ala Val Ser Cys Leu Ser Ser Ser Pro Ala Ser Pro Gly 
245 250 255 

35 His Ser Gin Pro Val Val Ser Ser Leu Thr Pro Thr Gly Ala Gly Gin 
260 265 270 



Gin Ala Phe Val Phe Ser Lys Asn Val Leu Ser Ser Leu Trp Tyr Leu 
275 280 285 

Asn Leu Thr Val Leu Ala Glu Asn Val Asn Met Cys Val Cys Cys Val 
290 295 300 



Asn Ser Phe Ser Cys Trp Glu Xaa 
45 305 310 



50 



(2) INFORMATION FOR SEQ ID NO: 476: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 aniino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476: 

Met Ala Gin His His Leu Trp lie Leu Leu Leu Cys Leu Gin Thr Trp 
15 10 15 

60 Pro Glu Ala Ala Gly Lys Asp Ser Glu He Phe Thr Val Asn Gly He 
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20 25 30 

Leu Gly Glu Ser Val Thr Phe Pro Val Asn He Gin Glu Pro Arg Gin 
35 40 45 

-5 

Val Lys He He Ala Trp Thr Ser Lys Thr Ser Val Ala Tyr Val Thr 
50 55 60 

Pro Gly Asp Ser Glu Thr Ala Pro Val Val Thr Val Thr His Arg Asn 
10 65 70 75 80 

Tyr Tyr Glu Arg He His Ala Leu Gly Pro Asn Tyr Asn Leu Val He 
85 90 95 

15 Ser Asp Leu Arg Met Glu Asp Ala Gly Asp Tyr Lys Ala Asp He Asn 
100 105 110 



20 



35 



50 



Thr Gin Ala Asp Pro Tyr Thr Thr Thr Lys Arg Tyr Asn Leu Gin He 
115 120 125 

Tyr Arg Arg Leu Gly Lys Pro Lys He Thr Gin Ser Leu Met Ala Ser 
130 135 140 



Val Asn Ser Thr Cys Asn Val Thr Leu Thr Cys Ser Val Glu Lys Glu 
25 145 150 155 160 

Glu Lys Asn Val Thr Tyr Asn Trp Ser Pro Leu Gly Glu Glu Gly Asn 
165 170 175 

30 Val Leu Gin He Phe Gin Thr Pro Glu Pisp Gin Glu Leu Thr Tyr Thr 

180 185 190 



Cys Thr Ala Gin Asn Pro Val Ser Asn Asn Ser Asp Ser He Ser Ala 
195 200 205 

Arg Gin Leu Cys Ala Asp He Ala Met Gly Phe Arg Thr His His Thr 
210 215 220 



Gly Leu Leu Ser Val Leu Ala Met Phe Phe Leu Leu Val Leu He Leu 
40 225 230 235 240 

Ser Ser Val Phe Leu Phe Arg Leu Phe Lys Arg Arg Gin Asp Ala Ala 
245 250 255 

45 Ser Lys Lys Thr He Tyr Thr Tyr He Met Ala Ser Arg Asn Thr Gin 
260 265 270 



Pro Ala Glu Ser Arg He Tyr Asp Glu He Leu Gin Ser Lys Val Leu 
275 280 285 

Pro Ser Lys Glu Glu Pro Val Asn Thr Val Tyr Ser Glu Val Gin Phe 
290 295 300 



Ala Asp Lys Met Gly Lys Ala Ser Thr Gin Asp Ser Lys Pro Pro Gly 
55 305 310 315 320 



Thr Ser Ser Tyr Glu He Val He Xaa 
325 



60 
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(2) INFORMATION FOR SEQ ID NO: 477: 

(i) SEQUENCE CHARACTERISTICS: 
"5 (A) LENGTH: 178 amino acids 

(B> TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 

10 Met Lys Leu Gin Cys Val Ser Leu Trp Leu Leu Gly Thr He Leu lie 
15 10 15 



15 



30 



Leu Cys Ser Val Asp Asn His Gly Leu Arg Arg Cys Leu He Ser Thr 
20 25 30 

Asp Met His His He Glu Glu Ser Phe Gin Glu He Lys Arg Ala He 
35 40 45 



Gin Ala Lys Asp Thr Phe Pro Asn Val Thr He Leu Ser Thr Leu Glu 
20 50 55 60 

Thr Leu Gin He He Lys Pro Leu Asp Val Cys Cys Val Thr Lys Asn 
65 70 75 80 

25 Leu Leu Ala Phe Tyr Val Asp Arg Val Phe Lys Asp His Gin Glu Pro 

85 90 95 



Asn Pro Lys He Leu Arg Lys He Ser Ser He Ala Asn Ser Phe Leu 
100 105 110 

Tyr Met Gin Lys Thr Leu Arg Gin Cys Gin Glu Gin Arg Gin Cys His 
115 120 125 



Cys Arg Gin Glu Ala Thr Asn Ala Thr Arg Val He His Asp Asn Tyr 
35 130 135 140 

Asp Gin Leu Glu Val His Ala Ala Ala He Lys Ser Leu Gly Glu Leu 
145 150 155 160 

40 Asp Val Phe Leu Ala Trp He Asn Lys Asn His Glu Val Met Ser Ser 

165 170 175 



Ala Xaa 



45 



(2) INFORMATION FOR SEQ ID NO: 478: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 

55 

Asp Thr Ala He Arg Val Ala Leu Ala Val Ala Val Leu Lys Thr Val 
15 10 15 

He Leu Gly Leu Leu Cys Leu Leu Leu Cys Gly Gly Gly Glu Gly Lys 
60 20 25 30 
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Val Ala Gly Arg Gin Ala Val Thr Ser Asp Gin Gin Ser Val Gly Arg 
35 40 45 

-5 Arg Asp Val Tyr 
50 



10 (2) INFORMATION FOR SEQ ID NO: 479: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 



20 



30 



Met Gin Lys Lys Asn Ser Leu Phe Phe Phe Phe Ala Phe Tyr Tyr Glu 
15 10 15 

Asn Lys Thr Asn Ala Pro Gly Glu Gly Ser Met lie Thr Arg Asn lie 
20 25 30 



Lys Glu Tyr Phe Leu Pro Phe Leu Phe Cys Cys Val Glu Ala Ser lie 
25 35 40 45 



Ala lie Asn Lys Leu Asn Tyr Leu His Trp Thr His Phe Gin 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 480: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 27 amino acids 

<B)- TYPE? -amino -aci<a 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 

40 Met Pro Gly Leu Ser Leu lie Leu Thr Val Thr Leu Leu Ala Val Ser 
15 10 15 



45 



Asp Ser Ala Ala Thr Cys lie Val Ala Lys Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 481: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481: 

55 

Met Ser Gly Pro Asp Val Glu Thr Pro Ser Ala lie Gin lie Cys Arg 
15 10 15 



He Met Arg Pro Asp Asp Ala Asn Val Ala Gly Asn Val His Gly Gly 
60 20 25 30 
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Thr lie Leu Lys Met lie Glu Glu Ala Gly Ala lie lie Ser Thr Arg 
35 40 45 . 

- 5 His Cys Asn Ser Gin Asn Gly Glu Arg Cys Val Ala Ala Leu Ala Arg 
50 55 60 



10 



25 



40 



55 



60 



Val Glu Arg Thr Asp Phe Leu Ser Pro Met Cys He Gly Glu Val Ala 
65 70 75 80 

His Val Ser Ala Glu He Thr Tyr Thr Ser Lys His Ser Val Glu Val 
85 90 95 



Gin Val Asn Val Met Ser Glu Asn He Leu Thr Gly Ala Lys Lys Leu 
15 100 105 110 

Thr Asn Lys Ala Thr Leu Trp Tyr Val Pro Leu Ser Leu Lys Asn Val 
115 120 125 

20 Asp Lys Val Leu Glu Val Pro Pro Val Val Tyr Ser Arg Xaa Glu Gin 
130 135 140 



Glu Glu Glu Gly Arg Lys Arg Tyr Glu Ala Gin Lys Leu Glu Arg Met 
145 150 155 160 

Glu Thr Lys Trp Arg Asn Gly Asp He Val Gin Pro Val Leu Asn Pro 
155 170 175 



Glu Pro Asn Thr Val Ser Tyr Ser Gin Ser Ser Leu He His Leu Val 
30 180 185 190 

Gly Pro Ser Asp Cys Thr Leu His Gly Phe Val His Gly Gly Val Thr 
195 200 205 

35 Met Lys Leu Met Asp Glu Val Ala Gly He Val Ala Ala Arg His Cys 
210 215 220 



Lys Thr Asn He Val Thr Ala Ser Val Asp Ala He Asn Phe His Asp 
225 230 235 240 

Lys He Arg Lys Gly Cys Val He Thr He Ser Gly Arg Met Thr Phe 
245 250 255 



Thr Ser Asn Lys Ser Met Glu He Glu Val Leu Val Asp Ala Asp Pro 
45 260 265 270 

Val Val Asp Ser Ser Gin Lys Arg Tyr Arg Ala Ala Ser Ala Phe Phe 
275 280 285 

50 Thr Tyr Val Ser Leu Ser Gin Glu Gly Arg Ser Leu Pro Val Pro Gin 
290 295 300 



Leu Val Pro Glu Thr Glu Asp Glu Lys Lys Arg Phe Glu Glu Gly Lys 
305 310 315 320 

Gly Arg Tyr Leu Gin Met Lys Ala Lys Xaa Gin Gly His Ala Xaa Xaa 
325 330 335 

Gin Pro Xaa 
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(2) INFORMATION FOR SEQ ID NO: 482: 

-5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 

Met Leu Asn Ser Asn He Asn Asp Leu Leu Met Val Thr Tyr Leu Ala 
15 10 15 

15 Asn Leu Thr Gin Ser Gin He Ala Leu Asn Glu Lys Leu Val Asn Leu 
20 25 30 



20 



(2) INFORMATION FOR SEQ ID NO: 483: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 



30 



40 



Met Arg Glu Thr Ser He Arg Val Leu Leu Met Leu Pro Ala Leu Glu 
15 10 15 



Ser Thr Ser Gly Leu Ser Ala Phe Met Gly Leu Gly Thr Arg He Gly 
35 20 25 30 



Cys Phe Lys Thr He Thr Cys Trp Pro Thr Ser Leu Thr Gin Arg Xaa 
35 40 45 



45 (2) INFORMATICS! FOR SEQ ID NO: 484: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484: 

Met Tyr Met Tyr Ser Leu Asn Val Phe Leu Ser Phe He Phe Leu Ala 
15 10 15 

55 

Leu Val Phe Lys Cys Val His Val Cys Gin Gly Ala Asn Ala Phe Leu 
20 25 30 

Phe Leu Lys Leu Val Phe 
60 35 
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(2) INFORMATION FOR SEQ ID NO: 485: 

-5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485: 

Met Gly Leu Arg Leu lie Cys Leu Glu Leu Thr Met Val Lys Ala Leu 
15 10 15 

15 Val Cys Glu Met Phe Leu Phe Phe Leu Met Thr Gin Lys Leu He Trp 
20 25 30 



20 



25 



45 



Gin Glu Cys Thr Glu Lys Phe Ala Lys Leu Leu Val Gin Leu He Ser 
35 40 45 

Leu Val Phe Ala Trp Glu Phe Phe Ser Glu Asp Thr Pro 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 486: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY; linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486: 

Met Leu Ala Ala Arg Leu Val Cys Leu Arg Thr Leu Pro Ser Arg Val 
35 1 5 10 15 

Phe His Pro Ala Phe Thr Lys Ala Ser Pro Val Val" Lys Asn Ser He 
20 25 30 

40 Thr Lys Asn Gin Trp Leu Leu Thr Pro Ser Arg Glu Tyr Ala Thr Lys 
35 40 45 



Thr Arg He Gly He Arg Arg Gly Arg Thr Gly Gin Glu Leu Lys Glu 

50 55 60 

Ala Ala Leu Glu Pro Ser Met Glu Lys He Phe Lys He Asp Gin Met 

65 70 75 80 



Gly Arg Trp Phe Val Ala Gly Gly Ala Ala Val Gly Leu Gly Ala Leu 
50 85 90 95 

Cys Tyr Tyr Gly Leu Gly Leu Ser Asn Glu He Gly Ala He Glu Lys 
100 105 110 

55 Ala Val He Trp Pro Gin Tyr Val Lys Asp Arg He His Ser Thr Tyr 
115 120 125 



60 



Met Tyr Leu Ala Gly Ser He Gly Leu Thr Ala Leu Ser Ala He Ala 
130 135 140 
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He Ser Arg Thr Pro Val Leu Met Asn Phe Met Met Arg Gly Ser Trp 
145 150 155 160 

Val Thr He Gly Val Thr Phe Ala Ala Met Val Gly Ala Gly Met Leu 
-5 165 170 175 

Val Arg Ser He Pro Tyr Asp Gin Ser Pro Gly Pro Lys His Leu Ala 
180 185 190 

10 Trp Leu Leu His Ser Gly Val Met Gly Ala Val Val Ala Pro Leu Thr 
195 200 205 



15 



He Leu Gly Gly Pro Leu Leu He Arg Ala Ala Trp Tyr Thr Ala Gly 
210 215 220 

He Val Gly Gly Leu Ser Thr Val Ala Met Cys Ala Pro Ser Glu Lys 
225 230 235 240 



Phe Leu Asn Met Gly Ala Pro Leu Gly Val Gly Leu Gly Leu Val Phe 
20 245 250 255 

Val Ser Ser Leu Gly Ser Met Phe Leu Pro Pro Thr Thr Val Ala Gly 
260 265 270 

25 Ala Thr Leu Tyr Ser Val Ala Met Tyr Gly Gly Leu Val Leu Phe Ser 
275 280 285 



30 



Met Phe Leu Leu Tyr Asp Thr Gin Lys Val He Lys Arg Ala Glu Val 
290 295 300 

Ser Pro Met Tyr Gly Val Gin Lys Tyr Asp Pro He Asn Ser Met Leu 
305 310 315 320 



Ser He Tyr Met Asp Thr Leu Asn He Phe Met Arg Val Ala Thr Met 
35 325 330 335 



40 



Leu Ala Thr Gly Gly Asn Arg Lys Lys Xaa 
340 345 



(2) INFORMATIOasr FOR SEQ ID NO: 487: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487: 

50 Met Glu Glu Val Leu Leu Leu Gly Leu Lys Asp Arg Glu Gly Tyr Thr 
15 10 15 

Ser Phe Trp Asn Asp Cys He Ser Ser Gly Leu Arg Gly Cys Met Leu 
20 25 30 



55 



He Glu Leu Ala Leu Arg Gly Arg Leu Gin Leu Glu Ala Cys Gly Met 
35 40 45 



Arg Arg Lys Ser Leu Leu Thr Arg Lys Val He Cys Lys Ser Asp Ala 
60 50 55 60 
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10 



25 



35 



50 



Pro Thr Gly Asp Val Leu Leu Asp Glu Ala Leu Lys His Val Lys Glu 
65 70 75 80 

Thr Gin Pro Pro Glu Thr Val Gin Asn Trp lie Glu Leu Leu Ser Gly 
85 90 95 

Glu Thr Trp Asn Pro Leu Lys Leu His Tyr Gin Leu Arg Asn Val Arg 
100 105 110 

Glu Arg Leu Ala Lys Asn Leu Val Glu Lys Gly Val Leu Thr Thr Glu 
115 120 125 



Lys Gin Asn Phe Leu Leu Phe Asp Met Thr Thr His Pro Leu Thr Asn 
15 130 135 140 

Asn Asn He Lys Gin Arg Leu He Lys Lys Val Gin Glu Ala Val Leu 
145 150 155 160 

20 Asp Lys Trp Val Asn Asp Pro His Arg Met Asp Arg Arg Leu Leu Ala 

165 170 175 



Leu He Tyr Leu Ala His Ala Ser Asp Val Leu Glu Asn Ala Phe Ala 
180 185 190 

Pro Leu Leu Asp Glu Gin Tyr Asp Leu Ala Thr Lys Arg Val Arg Gin 
195 200 205 



Leu Leu Asp Leu Asp Pro Glu Val Glu Cys Leu Lys Ala Asn Thr Asn 
30 210 215 220 



Glu Val Leu Trp Ala Val Val Ala Ala Phe Thr Lys Xaa 
225 230 235 



(2) INFORMATION FOR SEQ ID NO: 488: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 200 aitiino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY-: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 

45 Met Ala Gin Arg Met Val Trp Val Asp Leu Glu Met Thr Gly Leu Asp 
15 10 15 

He Glu Lys Asp Gin He He Glu Met Ala Cys Leu He Thr Asp Ser 
20 25 30 



Asp Leu Asn He Leu Ala Glu Gly Pro Asn Leu He He Lys Gin Pro 
35 40 45 



Asp Glu Leu Leu Asp Ser Met Ser Asp Trp Cys Lys Glu His His Gly 
55 50 55 60 

Lys Ser Gly Leu Thr Lys Ala Val Lys Glu Ser Thr He Thr Leu Gin 
65 70 75 80 

60 Gin Ala Glu Tyr Glu Phe Leu Ser Phe Val Arg Gin Gin Thr Pro Pro 



ISDOCID: <WO ^9839448A2_IA> 



wo 98/39448 



650 



PCT/US98/04493 



85 

Gly Leu Cys Pro Leu Ala Gly 
100 

-5 

Leu Asp Lys Tyr Met Pro Gin 
115 

lie Asp Val Ser Thr Val Lys 
10 130 135 

Glu Tyr Glu Phe Ala Pro Lys 
145 150 

15 Asp lie Ser Glu Ser lie Lys 

165 



90 95 

Asn Ser Val His Glu Asp Lys Lys Phe 
105 110 

Phe Met Lys His Leu His Tyr Arg lie 
120 125 

Glu Leu Cys Arg Arg Trp Tyr Pro Glu 
140 

Lys Ala Ala Ser His Arg Ala Leu Asp 
155 160 

Glu Leu Gin Phe Tyr Arg Asn Asn lie 
170 175 



Phe Lys Lys Lys lie Asp Glu Lys Lys Arg Lys He He Glu Asn Gly 
180 185 190 

20 

Glu Asn Glu Lys Thr Val Ser Xaa 
195 200 



25 

(2) INFORMATION FOR SEQ ID NO: 489: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 

Met Ala Thr Thr Ala Ala Pro Ala Gly Gly Ala Arg Asn Gly Ala Gly 
35 1 5 10 15 

Pro Glu Trp Gly Gly Phe Glu Glu Asn He Gin Gly Gly Gly Ser Ala 
20 25 30 

40 Val He Asp Met Glu Asn Met Asp Asp Thr Ser Gly Ser Ser Phe Glu 
35 40 45 

Asp Met Gly Glu Leu His Gin TVrg Leu Arg Glu Glu Glu Val Asp Ala 
50 55 60 

45 

Asp Ala Ala Asp Ala Ala Ala Ala Glu Glu Glu Asp Gly Glu Phe Leu 
65 70 75 80 

Gly Met Lys Gly Phe Lys Gly Gin Leu Ser Arg Gin Val Ala Asp Gin 
50 85 90 95 

Met Trp Gin Ala Gly Lys Arg Gin Ala Ser Arg Ala Phe Ser Leu Tyr 
100 105 110 

55 Ala Asn He Asp He Leu Arg Pro Tyr Phe Asp Val Glu Pro Ala Gin 
115 120 125 



60 



Val Arg Thr Gly Leu Leu Glu Ser Met He Pro He Lys Met Val Asn 
130 135 140 
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Phe Pro Gin Lys lie Ala Gly Glu Leu Tyr Gly Pro Leu Met Leu Val 
145 150 155 160 

Phe Thr Leu Val Ala lie Leu Leu His Gly Met Lys Thr Ser Asp Thr 
5 165 170 175 

lie lie Arg Glu Gly Thr Leu Met Gly Thr Ala lie Gly Thr Cys Phe 
180 185 190 

10 Gly Tyr Trp Leu Gly Val Ser Ser Phe lie Tyr Phe Leu Ala Tyr Leu 
195 200 205 



15 



30 



40 



55 



Cys Asn Ala Gin He Thr Met Leu Gin Met L^eu Ala Leu Leu Gly Tyr 
210 215 220 

Gly Leu Phe Gly His Cys He Val Leu Phe He Thr Tyr Asn He His 
225 230 235 240 



Leu His Ala Leu Phe Tyr Leu Phe Trp Leu Leu Val Gly Gly Leu Ser 
20 245 250 255 

Thr Leu Arg Met Val Ala Val Leu Val Ser Arg Thr Val Gly Pro Thr 
260 265 270 

25 Gin Arg Leu Leu Leu Cys Gly Thr Leu Ala Ala Leu His Met Leu Phe 
275 280 285 



Leu Leu Tyr Leu His Phe Ala Tyr His Lys Val Val Glu Gly He Leu 
290 295 300 

Asp Thr Leu Glu Gly Pro TLsn He Pro Pro He Gin Arg Val Pro Arg 
305 310 315 320 



Asp He Pro Ala Met Leu Pro Ala Ala Arg Leu Pro Thr Thr Val Leu 
35 325 330 335 



Asn Ala Thr Ala Lys "Ala Val Ala Val Thr Leu Gin Ser "His Xaa 
340 345 350 



(2) INFORMATION FOR SEQ ID NO: 490: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 
(D) TOPOLCX3Y: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490: 

50 Met Arg Gly Ser Arg Gly Gly Trp Ala Gly Glu Met Ala Ala Ser Gly 
15 10 15 

Glu Ser Gly Thr Ser Gly Gly Gly Gly Ser Thr Glu Glu Ala Phe Met 
20 25 30 



Thr Phe Tyr Ser Glu Val Lys Gin He Glu Lys Arg Asp Ser Val Leu 
35 40 45 



Thr Ser Lys Asn Gin He Glu Arg Leu Thr Arg Pro Gly Ser Ser Tyr 
60 50 55 60 
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10 



25 



40 



652 



Phe Asn Leu Asn Pro Phe Glu Val Leu Gin lie Asp Pro Glu Val Thr 
65 70 75 80 

Asp Glu Glu He Lys Lys Arg Phe Arg Gin Leu Ser He Leu Val His 
B5 90 95 

Pro Asp Lys Asn Gin Asp Asp Ala Asp Arg Ala Gin Lys Ala Phe Glu 
100 105 110 

Ala Val Asp Lys Ala Tyr Lys Leu Leu Leu Asp Gin Glu Gin Lys Lys 
115 120 125 



Arg Ala Leu Asp Val He Gin Ala Gly Lys Glu Tyr Val Glu His Thr 
15 130 135 140 

Val Lys Glu Arg Lys Lys Gin Leu Lys Lys Glu Gly Lys Pro Thr lie 
145 150 155 160 

20 Val Glu Glu Asp Asp Pro Glu Leu Phe Lys Gin Ala Val Tyr Lys Gin 

165 170 175 



Thr Met Lys Leu Phe Ala Glu Leu Glu He Lys Arg Lys Glu Arg Glu 
180 185 190 

Ala Lys Glu Met His Glu Arg Lys Arg Gin Arg Glu Glu Glu He Glu 
195 200 205 



Ala Gin Glu Lys Ala Lys Arg Glu Arg Glu Trp Gin Lys Asn Phe Glu 
30 210 215 220 

Glu Ser Arg Asp Gly Arg Val Asp Ser Trp Arg Asn Phe Gin Ala Asn 
225 230 235 240 

35 Thr Lys Gly Lys Lys Glu Lys Lys Asn Arg Thr Phe Leu Arg Pro Pro 

245 . ..... 250 „ _ ^ . 25_5 _ 



Lys Val Lys Met Glu Gin Arg Glu Xaa 
260 265 



(2) INFORMATION FOR SEQ ID NO: 491: 

45 (i) SEQUENCE CHARACTERISTICS: 

<A) LETJGTH: 25 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491: 



50 



Asp Ser Met Pro Thr Cys Pro Leu Xaa Ala Ser Leu Glu Cys Gly Pro 
15 10 15 



Leu Leu Pro Val Arg Leu Cys Cys Leu 
55 20 25 



60 



(2) INFORMATION FOR SEQ ID NO: 492: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LErJGTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 ' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492: 

Met Asn Glu Tyr Arg Val Pro Glu Leu Asn Val Gin Asn Gly Val Leu 
1 5 10 ' 15 

10 Lys Ser Leu Ser Phe Leu Phe Glu Tyr lie Gly Glu Met Gly Lys Asp 
20 25 30 

Tyr lie Tyr Ala Val Thr Pro Leu Leu Glu Asp Ala Leu Met Asp Arg 
35 40 45 

15 

Asp Leu Val His Arg Gin Thr Ala Ser Ala Val Val Gin His Met Ser 
50 55 60 

Leu Gly Val Tyr Gly Phe Gly Cys Glu Asp Ser Leu Asn His Leu Leu 
20 65 70 75 80 

Asn Tyr Val Trp Pro Asn Val Phe Glu Thr Ser Pro His Val lie Gin 
85 90 95 

25 Ala Val Met Gly Ala Leu Glu Gly Leu Arg Val Ala lie Gly Pro Cys 
100 105 110 

Arg Met Leu Gin Tyr Cys Leu Gin Gly Leu Phe His Pro Ala Arg Lys 
115 120 125 

30 

Val Arg Asp Val Tyr Trp Lys lie Tyr Asn Ser lie Tyr lie Gly Ser 
130 135 140 

Gin Asp Ala Leu lie Ala His Tyr Pro Arg lie Tyr Gin Arg Xaa 
35 145 150 155 



(2) INFORMATION FOR SEQ ID NO: 493: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 amino acids 

(B) TypE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 

Met He Ser Asp Asn Ser Ala Glu Asn He Ala Leu Val Thr Ser Met 
15 10 15 

50 Tyr Asp Gly Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin 
20 25 30 

Leu Glu Asp He Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala 
35 40 45 

55 

Ala Lys Glu Gly Lys He Glu He Phe Arg His He Leu Gin Arg Glu 
SO 55 60 

Phe Ser Gly Leu Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr 
60 65 70* 75 80 
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Gly Pro Val Arg Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys 
85 90 95 

"Glu Glu Asn Ser Val Leu Glu lie He Ala Phe His Cys Lys Ser Pro 
100 105 110 

His Arg His Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin 
115 120 125 

Ala Lys Trp Asp Leu Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys 
130 135 140 

Asn Leu He Tyr Met Phe He Phe Thr Ala Val Ala Tyr His Gin Pro 
145 150 155 160 

Thr Leu Lys Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn 
165 170 175 

Ser Met Leu Leu Thr Gly His He Leu He Leu Leu Gly Gly He Tyr 
180 185 190 

Leu Leu Val Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe He 
195 200 205 

Trp He Ser Phe He Asp Ser Tyr Phe Glu He Leu Phe Leu Phe Gin 
210 215 220 

Ala Leu Leu Thr Val Val Ser Gin Val Leu Cys Phe Leu Xaa He Glu 
225 230 235 240 

Trp Tyr Leu Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn 
245 250 255 

Leu Leu Tyr Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val 
260 265 270" 

Met He Gin Lys Pro Trp Xaa 
275 



(2) INFORMATION FOR SEQ ID NO: 494: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TOPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494: 

Met He Arg Cys Gly Leu Ala Cys Glu Arg Cys Arg Trp He Leu Pro 
15 10 15 

Leu Leu Leu Leu Ser Ala He Ala Phe Asp He He Ala Leu Ala Gly 
20 25 30 

Arg Gly Trp Leu Gin Ser Ser Asp His Gly Gin Thr Ser Ser Leu Trp 
35 40 45 



Trp Lys Cys Ser Gin Glu Gly Gly Gly Ser Gly Ser Tyr Glu Glu Gly 
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50 55 60 

Cys Gin Ser Leu Met Glu Tyr Ala Trp Gly Arg Ala Ala Ala Ala Met 
65 70 75 80 

5 - 

Leu Phe Cys Gly Phe lie lie Leu Val lie Cys Phe He Leu Ser Phe 
85 90 95 

Phe Ala Leu Cys Gly Pro Gin Met Leu Val Phe Leu Arg Val He Gly 
10 100 105 110 

Gly Leu Leu Ala Leu Ala Ala Val Phe Gin He He Ser Leu Val He 
115 120 125 

15 Tyr Pro Val Lys Tyr Thr Gin Thr Phe Thr Leu His Ala Asn Xaa Ala 
130 135 140 

Val Thr Tyr He Tyr Asn Trp Ala Tyr Gly Phe Gly Trp Ala Ala Thr 
145 150 155 160 

20 

He He Leu He Gly Cys Ala Phe Phe Phe Cys Cys Leu Pro Asn Tyr 
165 170 175 

Glu Asp Asp Leu Leu Gly Asn Ala Lys Pro Arg Tyr Phe Tyr Thr Ser 
25 180 185 190 

Ala 



30 



45 



60 



(2) INFORMATION FOR SEQ ID NO: 495: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 

(D) TOTOLOGY:^ linear' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495: 

40 Met Ala Ala Gly Asp Gin Val Phe Ser Gly Ala Gly His Val Xaa Glu 
15 10 15 

His Val Ala Gly Gly Arg His Ala Trp Leu Leu Thr Trp Gin Ser Ala 
20 25 30 



Cys Pro Ala Asn Arg Leu Ser Leu Val Pro Leu Val Pro Ser Ala Ser 
35 40 45 



Met Thr Arg Leu Met Arg Xaa Arg Thr Ala Ser Gly Ser Ser Val He 
50 50 55 60 

Leu Trp Met Ala Pro Ala Ala Ala Pro Thr Pro Ala Arg Ala Pro Glu 
65 70 75 80 

55 Ala Ala Pro Thr Pro Ala Arg Ala Pro Ala Ala Ala Arg Thr Pro Ala 

85 90 95 



Arg Gly Pro Thr Trp Thr Ser Pro Pro Thr Arg Val Leu Leu Gly Thr 
100 105 110 
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Xaa Pro Gly Pro Ser Pro Trp Arg 
115 120 

Leu Pro Pro Pro Asp Ser Asp Leu 
130 135 

Pro Phe Ser Pro Pro Ala Cys His 
145 150 

Gin Ser Leu Phe Cys Lys Ser Glu 
165 

lie Thr Trp Val Pro Ser Pro Lys 
180 

Arg Lys Gin Ala Ser Pro His Arg 
195 200 



Ser Pro Ala Arg Arg Pro Ala Gin 
125 

Cys Ser Gly Pro Leu Leu Pro Gly 
140 

Thr Ala Pro Asn Ser Val Leu lie 
155 160 

Leu Trp Trp Arg Gin Met Arg Ser 
170 175 

Ala Gly Trp Arg Trp Thr Lys Gly 
185 190 

lie Leu Phe His Xaa 
205 



<2) INFORMATION FOR SEQ ID NO: 496: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 

Met Ala Leu Thr Leu Leu Pro Ser Val Ser Arg Leu Pro Gly Glu Arg 
15 10 15 

Met Ala Ala Ser Gly Leu Pro Tyr Val Leu His His Lys Ser Ser Leu 
20 25 30 

Met Lys Val lie Phe Phe Pro Tyr Pro Val Leu Pro Leu Pro Ala Pro 
35 40 45 

Asn Gly Thr Trp Val Pro Arg Leu Val Leu Gly Leu Gly Ser Gly Asp 
50 55 60 

Gin Val His Tyr Leu Pro lie Ser Ser Ser lie Val Asn Tyr Gly Thr 
65 70 75 80 

Ser Val Ser Gly Lys Ser Trp Val Phe Leu Val Tyr Pro Leu His Pro 
85 90 95 

Thr Pro Thr Trp Ser Thr Arg Cys Phe Gin Val Trp Asp Leu Leu Ser 
100 105 110 

Val Glu Leu Pro Asp Lys Gly Glu Gly Asn Thr Arg Arg Ala Ser Gly 
115 120 125 

Val Pro Gly Leu Ser Gin Leu Pro Thr Ser His Lys Pro lie Lys Gin 
130 135 140 



Glu Tyr Xaa 
145 
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(2) INFORMATION FOR SEQ ID NO: 497: 

Ci) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 64 amino acids 
5 " (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497: 

Met Val Trp Val Leu Trp Ser Ala Pro Ser Leu Ala Pro Pro Trp Val 
10 1 5 10 15 

Gly Pro Cys Trp Pro Ser Thr Gly Asn Cys Cys Leu Cys Glu Val Gly 
20 25 30 

15 Ala Ala Leu Pro Pro Arg Gly Pro Ser Leu Ser Asp Cys Leu Gly Leu 
35 40 45 



20 



Pro Pro Trp Thr Pro Trp Gly Pro Ala Trp Thr Leu Ala Gin Ser Xaa 
50 55 60 



25 



45 



(2) INFORMATION FOR SEQ ID NO: 498: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498: 

Met Ser Thr Gly Ala Leu Asn Thr Ser Pro Pro Ala Ser Asn Arg Leu 
35 1 5 10 15 



Glu Ser Thr Leu Asn Glu Tyr Leu lie Gin Pro Gin Leu His Cys Ser 
20 25 30 

40 Ser Val Gin Arg Leu Thr Leu Lys Trp Gly Cys Ser Ser Leu Gin Arg 
35 40 45 



Asp Gly Gin Ala Val Pro Trp Gly Leu Trp Gin Arg Ala Tyr Pro Ser 
50 55 60 

Leu Leu Pro Thr Leu Pro Ser Asp Leu Leu Arg Pro His Ala Val Thr 
65 70 75 80 



Pro Ser Val Ser Val Ser Val His Thr Cys Glu Ser Ser Xaa 
50 85 90 



(2) INFORMATION FOR SEQ ID NO: 499: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE; amino acid 
<D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499: 
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Met Phe Leu He Phe Val Tyr Phe Leu Lys Xaa Leu Phe Ser Ser Ser 
15 10 15 

Leu Pro Phe Leu Trp Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 

Arg Gly Gly Leu Cys Pro Leu Leu Val Pro Gly Pro Leu Ala Arg Gin 
1 5 10 15 

Glu Pro Ser Pro Ser Leu Gin Gly Cys Ser Glu Ser Pro Val Gly Met 
20 25 30 

Asp 



(2) INFORMATION FOR SEQ ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 

Met Gin Phe Leu Leu Thr Ala Phe Leu Leu Val Pro Leu Leu Ala Leu 
15 10 15 

Cys Asp Val Pro lie Ser Leu Gly Phe Ser Pro Ser 
20 25 



(2) INFORMATION FOR SEQ ID NO: 502: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 

Pro Gly Lys Pro Gin Ala Cys Pro Glu Leu Thr Ser Val Leu Pro 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 503: 
(i) SEQUENCE CHARACTTERISTICS : 



.9839448A2JA> 
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(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 

5 ■ 

Asn Lys Ser Leu Xaa Ser Cys Leu Phe Val Leu His Phe Val Leu His 
15 10 15 

Cys Xaa Phe 

10 



15 



(2) INFORMATION FOR SEQ ID NO: 504: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 

Met Glu Lys Thr His Arg Leu Arg lie Arg Asn Pro Cys Leu Gin Phe 
15 10 15 

25 Ser lie Leu Asn Leu Phe Leu Leu Lys Met lie Val Ser 
20 25 



30 (2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(xi) SEIQUENCE DESCMPTION :. SEQ ID NO: 505: 



40 



Met Val Asp lie Ser Lys Met His Met lie Leu Tyr Asp Leu Gin Gin 
15 10 15 

Asn Leu Ser Ser Ser His Arg Ala Leu Glu Lys Gin lie Asp Thr Leu 
20 25 30 



Ala Gly Lys Leu Asp Ala Leu Thr Glu Leu Leu Ser Thr Ala Leu Gly 
45 35 40 45 

Pro Ser Ser Phe Gin Asn Pro Ala Ser Ser Pro Ser Ser Trp Thr His 
50 55 60 

50 Glu Glu Glu Pro Gly Tyr Phe Pro Gin Tyr Xaa 
65 70 75 



55 (2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: - 

Leu Pro I^eu Ala Glu Leu Lys Asn Trp Val 
15 10 

5 - 



(2) INFORMATION FOR SEQ ID NO: 507: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 



15 



30 



45 



55 



Met Leu Trp Phe Gly Gly Cys Ser Ala Val Asn Ala Thr Gly His Leu 
15 10 15 



Ser Asp Thr Leu Trp Leu lie Pro He Thr Phe Leu Thr He Gly Tyr 
20 20 25 30 

Gly Asp Val Val Pro Gly Thr Met Trp Gly Lys He Val Cys Leu Cys 
35 40 45 

25 Thr Gly Val Met Gly Val Cys Cys Thr Ala Leu Leu Val Ala Val Val 
50 55 €0 



Ala Arg Lys Leu Glu Phe Asn Lys Ala Glu Lys His Val His Asn Phe 
65 70 75 80 

Met Met Asp He Gin Tyr Thr Lys Glu Met Lys Glu Ser Ala Ala Arg 
85 90 95 



Val Leu Gin Glu Ala Trp Met Phe Tyr Lys His Thr Arg Arg Lys Glu 

35 100 105 110 



Ser His Ala Ala Arg Arg His Gin Arg Xaa Leu Leu Ala Ala He Asn 
115 120 125 

40 Ala Phe Arg Gin Val Arg I^eu Lys His Arg Lys Leu Arg Glu Gin Val 
130 135 140 

Asn Ser Met Val Asp He Ser Lys Met His Met He Leu Tyr Asp Leu 
145 150 155 160 



Gin Gin Asn Leu Ser Ser Ser His Arg Ala Leu Glu Lys Gin He Asp 
165 170 175 



Thr Leu Ala Gly Lys Leu Asp Ala Leu Thr Glu Leu Leu Ser Thr Ala 
50 180 185 190 



1^ Gly Pro Arg Gin Leu Pro Glu Pro Ser Gin Gin Ser Lys Xaa 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 508: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 36 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 

5 Met Trp Arg Cys Arg Gly Lys Lreu Ser Phe Pro Leu Phe Ala Val Val 
1 5 10 15 

He Val Ser Cys Arg Lys Asp Gly Pro Asp Ala Ala Ala Ala Pro Ala 
20 25 30 

10 

Val Xaa Lys Lys 
35 



15 

(2) INFORMATION FOR SEQ ID NO: 509: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 509: 

Met Ala Leu Val Ala Leu Phe Thr Gin Leu Met Arg Xaa Leu Gly Arg 
25 1 5 10 15 

Cys Pro Gin 



30 



(2) INFORMATION FOR SEQ ID NO: 510: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 32 amino acids 

<B) TYK: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 

40 Met Thr Phe Pro Phe Glu Lys Glu Asn Ser Cys Phe Gin Cys Leu Leu 
15 10 15 

Phe Asp Ser Trp Arg Glu Gin Thr Arg Thr Asn lie Gin Pro Gin Arg 
20 25 30 

45 



50 

(2) INFORMATION FOR SEQ ID NO: 511: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
55 (B) TYPE: amino acid 

<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511: 

Met His Leu I^eu Asp Phe Phe Arg Asp Leu Val Leu Leu Val Leu Leu 
60 1 5 10 15 
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Ala Leu Leu Asp Ser Phe Trp Leu Glu Val Gin Lys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 512: 

(i) SEQUENCE CHARACTERISTICS: 
10 <A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 

15 Met Cys Leu lie His Phe lie Lys lie lie Leu Val Phe lie Leu Lys 
15 10 15 



20 



Leu Trp Leu Tyr Ser Gin Lys Cys Pro Lys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 513: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 

30 

Met He His Val His Glu Trp Asn Asp Gin Met Leu Met Val Tyr He 
15 10 15 

Phe Leu Tyr Pro Val Ser He Thr Phe Leu Asn Leu Cys Ser Leu Thr 
35 20 25 30 



Cys 



40 



55 



(2) INFORMATION FOR SEQ ID NO: 514: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A> LmGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: lineeur 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 

50 Leu Asn Glu Ser Tyr Val Ser Ar^ Ala Gly Gly Trp Phe Ser Met Phe 
15 10 15 

Xaa I.eu He Phe Phe Leu Leu Ala Leu Gly Ser Xaa Leu Cys Leu Leu 
20 25 30 



Leu Cys Leu Pro Ser Phe Asn Lys Thr Arg Arg Lys Gin Lys Pro 
35 40 45 



60 
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(2) INFORMATION FOR SEQ ID NO: 515: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 
5 ~ (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTICaa : SEQ ID NO: 515: 

Ser Ser Lys Thr Pro Leu Pro Ser Glu Arg Arg Trp lie Ser Gly Ser 
10 1 5 10 15 

Ser Leu Met Ala Pro Arg Pro Trp L-eu Leu Gly lie Ala Lreu Leu Gly 
20 25 30 

15 Leu Trp Ala Leu Glu Pro Ala Leu Gly His Trp 
35 40 



20 (2) INFORMATION FOR SEQ ID NO: 516: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»a: SEQ ID NO: 516: 



30 



40 



55 



Leu Asn Trp 
1 



(2) INFORMATION FOR SEQ ID NO: 517: 

35 (i) SEQUENCE CHARACTERISTICS: 

. . (A) JjENGTH: JL74 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 



Phe Ala Phe Cys Ala Glu Leu Met lie Gin Asn Trp Thr Leu Gly Ala 
15 10 15 



Val Asp Ser Gin Met Asp Asp Met Asp Met Asp Leu Asp Lys Glu Phe 
45 20 25 30 

Leu Gin Asp Leu Lys Glu Leu Lys Val Leu Val Ala Asp Lys Asp Leu 
35 40 45 

50 Leu Asp Leu His Lys Ser Leu Val Cys Thr Ala Leu Arg Gly Lys Leu 
50 55 60 



Gly Val Phe Ser Glu Met Glu Ala Asn Phe Lys Asn Leu Ser Arg Gly 
65 70 75 80 

Leu Val Asn Val Ala Ala Lys Leu Thr His Asn Lys Asp Val Arg Asp 
B5 90 95 



Leu Phe Val Asp Leu Val Glu Lys Phe Val Glu Pro Cys Arg Ser Asp 
60 100 105 110 
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His Trp Pro Leu Ser Asp Val Arg Phe Phe Leu Asn Gin Tyr Ser Ala 
115 120 125 

5 Ser Val His Ser Leu Asp Gly Phe Arg His Gin Ala Ser Gly Thr Ala 
130 135 140 

Thr Trp Ala Pro Ser Ala Ala Ala Ser Cys Ala Cys He Met Thr Glu 
145 150 155 160 

10 

Val Pro Pro Asn Ala Pro Pro Thr Leu Thr He Lys Leu Leu 
165 170 



15 

(2) INFORMATION FOR SEQ ID NO: 518: 

, (i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 43 amino acids 
20 <B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

Met Trp Lys Asn Leu Gly Ser Gly Ser Val Phe Val Thr Trp Phe Ser 
25 1 5 10 15 

Leu Val Met lie Leu Ser Gly lie Gly Pro Leu Gly Asp Ala Glu Asp 
20 25 30 

30 Ser He Ser Asp Val Ser His Arg Leu Arg Pro 
35 40 



35 (2) INFORMATION FOR SEQ ID NO: 519: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 

Phe Gin Phe Pro Leu Leu Thr He Ala Leu Gin Phe Leu 
15 10 

45 



(2) INFORMATION FOR SEQ ID NO: 520: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) USNGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 520: 

55 

Met His Tyr Val He Val Leu Ser Leu Phe Val Val Leu Glu Lys Lys 
15 10 15 

Asn Lys Met Gly Ser Asp Gly Cys Leu Arg Lys Asn Gly Ser 
60 20 25 30 
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(2) INFORMATION FOR SEQ ID NO: 521: 

5 - 

(i) SEQUENCE CHARACTERISTICS: 

(A) IjENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 

Met Ser Arg Ser lie Val Leu Arg Gly Ser Leu Phe Leu Phe Phe Ser 
15 10 15 

15 His Tyr Thr Leu Lys Leu Leu Ser Val lie Lys Gin Thr Asn Arg Lys 
20 25 30 

lie Val Trp Glu Lys Pro Cys lie Arg Leu Phe Tyr Xaa Val Leu 
35 40 45 

20 



(2) INFORMATION FOR SEQ ID NO: 522: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 

30 

Met Pro Leu Pro Val Leu Leu Cys Leu Thr Leu Pro Met Pro Leu Pro 
15 10 15 

Ser Ala Thr Ala Arg Gly Gly Asn Arg Thr 
35 20 25 



40 



(2) INFORMATION FOR SEQ ID NO: 523: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 eunino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi> SEQUENCE DESCRIPTION: SEQ ID NO: 523: 

Ser Ser lie Pro Val Ser lie Leu lie Gly Met Lys Leu He Leu Tyr 
15 10 15 

50 Leu Leu He Thr Glu Ser Gly Ser His Glu Lys Lys Ser Phe Tyr Pro 
20 25 30 



55 



Ser Phe Lys Tyr Met Phe Lys He He He Tyr Val Ser Ala Tyr Cys 
35 40 45 

Arg Thr Ala Leu Arg Ala Thr Val Ser His 
50 55 



60 
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(2) INFORMATION FOR SEQ ID NO: 524: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: lineaur 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 

Asn Arg Thr Leu Leu Phe Leu lie Leu Phe Val Leu Phe Gly Leu Gly 
15 10 15 

Tyr Gly Phe 



{2> INFORMATION FOR SEQ ID NO: 525: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

CB) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 

Met Phe Leu Leu Val Leu Ser Val Phe Cys Asp Phe Met Cys Ser lie 
15 10 15 

Ala Pro Arg Cys His Ala Leu Ser Leu Val Ser Leu Arg Ala Gin His 
20 25 30 

Leu Ser Leu Phe lie Thr Cys His 
35 40 



(2) INFORMATION FOR SEQ ID NO: S26 : 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 57 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION^: SEQ ID NO: 526: 



Met Leu Leu Phe lie Leu Leu Thr 
1 5 

Val Ser Ser Trp Lys Thr Phe Leu 
20 

Arg Glu His Pro Glu Gly Ser Arg 
35 40 

Pro Gly Ala His Cys Leu His Cys 
50 55 



Leu Ser Ser Gly Cys Arg Leu Leu 
10 15 

Pro His Phe Ser Leu Pro Gly Pro 
25 30 

Thr Trp Phe Phe Arg Tyr Trp Glu 
45 

Ala 



(2) INFORMATION FOR SEQ ID NO: 527: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 

5 - 

Ala Arg Leu Leu Leu Phe Leu Ser Ser Val His Pro Ser lie Met Pro 
1 ' 5 10 15 

Ser Cys Asn Gin Leu 
10 20 



15 



40 



55 



(2) INFORMATION FOR SEQ ID NO: 528: 



(i) SEQUENCE CHARACTERXSTICS : 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 

Met Ser Leu Thr Ser Ser Leu Thr Phe Leu Ser His lie Leu Leu Leu 
15 10 15 

25 Pro Gin Lys Leu Gin Phe Leu Ser Trp Met Glu Arg Gin Gin Arg Cys 
20 25 30 

Thr Gly Val Ala Lys Tyr Ala 
35 

30 



(2) INFORMATION FOR SEQ ID NO: 529: 

35 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 128 amino acids 
(S) "TVPl:: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529: 



Met Val Leu Arg Leu lie Gin Leu lie Phe Leu lie Phe Phe lie His 
15 10 15 



lie lie lie Leu Leu He Pro Gly Ser Arg Pro Cys Gly Ser Trp Val 
45 20 25 30 

Asn Asp Arg Xaa Leu Gly Leu Arg Asp Val Thr His Leu He Tyr Leu 
35 40 45 

50 His Trp Val His Gly His Leu Pro Trp Cys His Pro Tyr He Gin Val 
50 55 60 



Glu Phe Ser Ala Leu He Glu Ser Thr Ala Gin Leu Gly Leu Pro Phe 
65 70 75 80 

Ser Trp Val Arg Val He His Pro Phe Leu Val Leu Pro Cys Leu Tyr 
85 90 95 



Ser Pro Gly Leu Lys Asn Gly He Phe Leu Phe Leu Leu Arg Ala Met 
60 100 105 110 
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Pro Gly Gly Met Phe Pro Gly Asn Leu Glu Ala Phe Arg Val Pro Val 
115 120 125 



10 (2) INFORMATION FOR SEQ ID NO: 530: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 



20 



35 



45 



Met Gly Ser Ser Val Leu Pro Phe Cys Val Cys Val Thr Ser Pro Ser 
15 10 15 

Leu Gly Gly Arg Cys lie Gin Gly Arg Phe Ala Ser His Ser Lys Phe 
20 25 30 



Trp Gly Phe Gly Xaa Lys Thr Ala Ser Phe Gly Ala Val Gly Glu Thr 
25 35 40 45 

Pro Pro Asp Gin Glu Pro Gin Lys Glu Thr Glu Pro Ala Thr Ser Ser 
50 55 60 

30 His Ala Arg Pro Trp Ala Arg Val lie Gly Leu Arg lie Trp Pro Gin 
65 70 75 80 



Pro Asn 



(2) INFORMATION FOR SEQ ID NO: 531: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENG^^i: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 



Met Leu Leu Ser Val Ala lie Phe lie Leu Leu Thr Leu Val Tyr Ala 
1 5 10 15 



Tyr Trp Thr Met 

50 20 



(2) INFORMATION FOR SEQ ID NO: 532: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 



60 
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Asn Cys Glu lie Leu Glu Tyr Cys Tyr Tyr Leu Thr Gin Leu Lys lie 
1 5 10 15 . 

5 " Ser Met Gly Lys Tyr Leu Ser He Pro Thr Val Leu Leu Lys He He 
20 25 30 

Arg Cys Ser He Thr Ala Val Ser Asp Ser Ser Thr Ser Trp Ala He 
35 40 45 

10 

Lys Ala Gin Leu Lys He Glu Asn Lys Asp Leu Asp Asn Lys Thr Ala 
50 55 60 

Lys Gly Gly Gly Gin Glu Ala Leu Thr Cys Thr 
15 65 70 75 



20 



(2) INFORMATION FOR SEQ ID NO: 533: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 

Met Phe Leu Met Arg Met His Leu Cys Phe Cys Lys Tyr Cys Cys Ser 
15 10 15 

30 Phe He Val Thr Pro Thr Ser Thr Ser Asn Thr Xaa Ser Tyr Leu Trp 
20 25 30 

Pro Trp He Ser Ala Ser Met Ala Gly Arg Gly Ser Xaa Trp Ala Cys 
35 40 45 



35 



Thr Leu Asn Ala Val Thr Arg Glu Gly Leu Pro Glu 

'50 " 55 " " ' 60 



40 

(2) INFORMATION FOR SEQ ID NO: 534: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 
45 (B) TYPE: amino acid 

<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 534: 

Met Ser Leu Leu Asn Thr His Thr Leu Cys Phe Val Leu Phe Cys Phe 
50 1 5 10 15 

Thr Leu Ser He Asn Gin Glu Lys Leu Ala Asn His Leu Ala Phe Arg 
20 25 30 

55 He Leu Phe Phe He Val Phe 
35 



60 (2) INFORMATION FOR SEQ ID NO: 535: 



ISDOOD: <WO 983S44aA2 IA> 



wo 98/39448 



PCT/US98/04493 



670 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGfY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535: 



Met Leu 
1 

10 



(2) INFORMATION FOR SEQ ID NO: 536: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536: 



20 



30 



Met Asp Gin Phe Lys lie Phe Tyr Phe Leu Lys Ala Phe Phe Ala Cys 
15 10 15 



Cys Asn Val Gin Asp Pro Ser Pro Phe Met Gly Glu Thr Gly Ser Tyr 
25 20 25 20 



Leu Asn lie Gly 
35 



(2) INFORMATION FOR SEQ ID NO: 537: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 14 amino acids 

- - (B) TYPE? amino -acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537: 

40 Met Phe Asp Phe Leu Ser Tyr Phe Lys Asp Leu Leu Ser Cys 
15 10 



45 (2) INFORMATION FOR SEQ ID NO: 538: 

(i) SEQUENCE CHARACTERISTICZS : 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538: 



55 



60 



Met Gly Phe Gly Phe Val Leu Asn He Phe Ser Phe Phe Leu Xaa Pro 
1 5 10 15 

Pro Leu 
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(2) INFORMATICS! FOR SEQ ID NO: 539: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 
5 ~ (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539: 

Leu Leu Leu Trp Thr Leu Leu Ala Xaa Tyr Xaa 
10 1 5 10 



15 



(2) INFORMATION FOR SEQ ID NO: 540: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEIOGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540: 

Met Ala Ala Gin Lys Asp Gin Gin Lys Asp Ala Glu Ala Glu Gly Leu 
15 10 15 

25 Ser Gly Thr Thr Leu Leu Pro Lys Leu lie Pro Ser Gly Ala Gly Arg 
20 25 30 

Glu Trp Leu Glu Arg Arg Arg Ala Thr lie Arg Pro Trp Ser Thr Phe 
35 40 45 



30 



Val Asp Gin Gin Arg Phe Ser Arg Pro Arg Asn Leu Gly Glu Leu Cys 
50 55 60 



Gin Arg Leu Val Arg Asn Val Glu Tyr Tyr Gin Ser Asn Tyr Val Phe 
35 65 70 75 80 

Val Phe Leu Gly Leu lie Leu Tyr Cys Val Val Thr Ser Pro Met Leu 
85 90 95 

40 Leu Val Ala Leu Ala Val Phe Phe Gly Ala Cys Xaa 
100 105 



45 (2) INFORMATION FOR SEQ ID NO: 541: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 



55 



Phe Val Phe Leu Gly Leu lie Leu Tyr Cys Val Val Thr Ser Pro Met 
15 10 15 

Leu Leu Val Ala Leu Ala Val Phe Phe Gly Ala Cys Tyr lie Leu Tyr 
20 25 30 



Leu Arg Thr Leu Glu Ser Lys Leu Val Leu Phe Gly Arg Glu Val Ser 
60 35 40 45 
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Pro Ala His Gin Tyr Ala Leu Ala Gly Gly lie Ser Phe Pro Phe Phe 
50 55 60 

5 -Trp Leu Ala Gly Ala Gly Ser Ala Val Phe Trp Val Leu Gly Ala Thr 
65 70 75 80 



10 



Leu Val Val He Gly Ser His Ala Ala Phe His Gin He Glu Ala Val 
85 90 95 

Asp Gly Glu Glu Leu Gin Met Glu Pro Val 
100 105 



15 



35 



(2) INFORMATION FOR SEQ ID NO: 542: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 axtiino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 

Met Asp Arg Phe Thr Val Ala Gly Val Leu Pro Asp He Glu Gin Phe 
25 1 5 10 15 

Phe Asn He Gly Asp Ser Ser Ser Gly Leu He Gin Thr Val Phe He 
20 25 30 

30 Ser Ser Tyr Met Val Leu Ala Pro Val Phe Gly Tyr Leu Gly Asp Arg 
35 40 45 



Tyr Asn Arg Lys Tyr Leu Met Cys Gly Gly He Ala Phe Trp Ser Leu 
50 55 60 

- Val -Thr Leu-Gly Ser. Ser _Plie_ He_ J>ro Gly J3lu-His_ Phe Trp.Leu Leu. 
65 70 75 80 



Leu Leu Thr Arg Gly Leu Val Gly Val Gly Glu Ala Ser Tyr Ser Thr 
40 85 90 95 

He Ala Pro Thr Leu He Ala Asp Leu Phe Val Ala Asp Gin Arg Thr 
100 105 110 

45 Gly Cys Ser Ala Ser Ser Thr Leu Pro Phe Arg Trp Ala Val Val Trp 
115 120 125 



50 



Ala Thr Leu Gin Ala Pro Lys Xaa 
130 135 



(2) INFORMATION FOR SEQ ID NO: 543: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LH^GTH: 424 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 

60 
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Met Ala Gly Asp Trp His Trp Ala Leu Arg Val Thr Pro Gly Leu Giy 
15 10 15 

Val Val Ala Val Leu Leu Leu Phe Leu Val Val Arg Glu Pro Pro Arg 
5 " 20 25 30 

Gly Ala Val Glu Arg His Ser Asp Leu Pro Pro Leu Asn Pro Thr Ser 
35 40 45 

10 Trp Trp Ala Asp Leu Arg Ala Leu Ala Arg Asn Pro Ser Phe Val Leu 
50 55 60 



15 



Ser Ser Leu Gly Phe Thr Ala Val Ala Phe Val Thr Gly Ser Leu Ala 

65 70 75 80 

Leu Trp Ala Pro Ala Phe Leu Leu Arg Ser Arg Val Val Leu Gly Glu 

85 90 95 



Thr Pro Pro Cys Leu Pro Gly Asp Ser Cys Ser Ser Ser Asp Ser Leu 
20 100 105 110 

lie Phe Gly Leu He Thr Cys Leu Thr Gly Val Leu Gly Val Gly l»eu 
115 120 125 

25 Gly Val Glu He Ser Arg Arg Xaa Arg His Ser Asn Pro Arg Ala Asp 
130 135 140 



30 



Pro Leu Val Cys Ala Thr Gly Leu Leu Gly Ser Ala Pro Phe Leu Phe 
145 150 155 160 

Leu Ser Leu Ala Cys Ala Arg Gly Ser He Val Ala Thr Tyr He Phe 
165 170 175 



He Phe He Gly Glu Thr Leu Leu Ser Met Asn Trp Ala He Val Ala 
35 180 185 190 

Asp He Leu Leu Tyr Val Val He Pro Thr Arg Arg Ser Thr Ala Glu 
195 200 205 

40 Ala Phe Gin He Val Leu Ser His Leu Leu Gly Asp Ala Gly Ser Pro 
210 215 220 



45 



Tyr Leu He Gly Leu He Ser Asp Arg Leu Arg Arg Asn Trp Pro Pro 
225 230 235 240 

Ser Phe Leu Ser Glu Phe Arg Ala Leu Gin Phe Ser Leu Met Leu Cys 
245 250 255 



Ala Phe Val Gly Ala Leu Gly Gly Ala Leu Ser Trp Ala Pro Xaa Ser 
50 260 265 270 

Ser Ijeu Arg Pro Thr Ala Gly Gly His Ser Cys Thr Cys Arg Ala Cys 
275 280 285 

55 Cys Thr Lys Gin Gly Pro Gin Thr Thr Gly Leu Trp Cys Pro Ser Gly 
290 295 300 



60 



Ala Ala Pro Pro Ala Cys Pro Trp Pro Val Cys Ser Ser Glu Arg Leu 
305 310 315 320 
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Pro Leu Thr Tyr Leu His lie Cys His Ser Xaa Pro Trp Ala His Pro 
325 330 335 

Thr Lys Gly Leu Gly Leu Thr Pro Trp Pro Gly Pro Ala Ser Arg Gly 
' 5 - 340 345 350 

Thr Leu Gly Arg Val Pro Ala Pro Arg His Tyr Xaa Gly Ser Ser Gly 
355 360 365 

10 Glu Glu Val Gly Val Gin Glu Gly Asp Pro Ser Pro Gin Gly Xaa Pro 
370 375 380 



15 



Gin Gly Leu Gly Ala lie Cys Asn Gly He Lys Phe Val Ala Arg Pro 
385 390 395 400 

Gin Val Pro Ala Leu Val Phe Leu Trp Val Ala Ser Asp Leu Ala Pro 
405 410 415 



Arg Leu His Pro Arg Ala Pro Glu 
20 420 



25 



40 



50 



(2) INFOE?MATION FOR SEQ ID NO: 544: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi> SEQUENCE DESCRIPTION: SEQ ID NO: 544: 

Met Phe Arg Phe Val lie Cys Leu Phe Leii Trp I^u Val Leu Cys Arg 
1 5 10 15 

35 Asp Ser Thr Ser Ala Ser Arg lie Ala Leu Tyr Tyr Arg lie Val Phe 

. . _ _ 20 . . . „ 25 _ „ 30. _ 



Leu lie His Gin Cys Ser Ser 
35 



(2) INFORMATION FOR SEQ' ID NO: 545: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) TiFMGTH: 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545: 



Met Leu Pro Trp Xaa Ala Gin Leu Leu Asp Arg Thr He Gly Pro Leu 
15 10 15 



Tyr Leu Leu Phe Val Gin Phe Ser Pro Ala Phe Ser Arg Thr Ser Pro 
55 20 25 30 

Trp Arg Ser Pro Lys Asn Phe Arg Arg Leu Tyr Pro Pro Cys Thr Thr 
35 40 45 

60 Ser Gly Cys Ala Ala Arg Trp Leu Phe Ser 
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50 55 



5 "(2) INFORMATION FOR SEQ ID NO: 546: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHOTH: 33 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 



15 



20 



25 



40 



55 



Met Gly Leu Ser Val Leu Leu Pro Leu Cys Leu Leu Gly Pro Gly Arg 
15 10 15 

Phe Thr Ser Gly Gin Lys Pro Leu Asp Thr Pro Gly Leu Gly Val Pro 
20 25 30 

Phe 



(2) INFORMATION FOR SEQ ID NO: 547: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 

Met Ala Lys Pro Gin Val Val Val Ala Pro Val Leu Met Ser Lys Leu 
15 10 15 

35 Ser Val Asn Ala Pro Glu Phe Tyr Pro Ser Gly Tyr Ser Ser Ser Tyr 
20 25 30 

Thr Glu Ser Tyr Glu Asp Gly Cys Glu Asp Tyr Pro Thr Leu Ser Glu 
35 40 45 



Tyr Val Gin Asp Phe Leu Asn His Leu Thr Glu Gin Pro Gly Ser Phe 
50 55 €0 



Glu Thr Glu lie Glu Gin Phe Ala Glu Thr Leu Asn Gly Cys Val Thr 
45 65 70 75 80 

Thr Asp Asp Ala Leu Gin Glu Leu Val Glu Leu lie Tyr Gin Gin Ala 
85 90 95 

50 Thr Ser lie Pro Asn Phe Ser Tyr Met Gly Ala Arg Leu Cys Asn Tyr 
100 105 110 



Leu Ser His His Leu Thr He Ser Pro Gin Ser Gly Asn Phe Arg Gin 
115 120 125 

Leu Leu Leu Gin Arg Cys Arg Thr Glu Tyr Glu Val Lys Asp Gin Ala 
130 135 140 



Ala Lys Gly Asp Glu Val Thr Arg Lys Arg Phe His Ala Phe Val Leu 
60 145 150 155 160 
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10 



25 



Phe Leu Gly Glu Leu Tyr Leu Asn Leu Glu lie Lys Gly Thr Asn Gly 
165 170 175 

"Gin Val Thr Arg Ala Asp lie Leu Gin Val Gly Leu Arg Glu Leu Leu 
180 185 190 

Asn Ala Leu Phe Ser Asn Pro Met Asp Asp Asn Leu lie Cys Ala Val 
195 200 205 

Lys Leu Leu Lys Leu Thr Gly Ser Val. Leu Glu Asp Ala Trp Lys Glu 
210 215 220 



Lys Gly Lys Met Asp Met Glu Glu lie lie Gin Arg He Glu Asn Val 
15 225 230 235 240 

Val Leu Asp Ala Asn Cys Ser Arg Asp Val Lys Gin Met Leu Leu Lys 
245 250 255 

20 Leu Val Glu Leu Arg Ser Ser Asn Trp Gly Arg Val His Ala Thr Ser 
260 265 270 



Thr Tyr Arg Glu Ala Thr Pro Glu Asn Asp Pro Asn Tyr Phe Met Asn 
275 280 285 

Glu Pro Thr Phe Tyr Thr Ser Asp Gly Val Pro Phe Thr Ala Ala Asp 
290 295 300 



Pro Asp Tyr Gin Glu Lys Tyr Gin Glu Leu Leu Glu Arg Glu Asp Phe 
30 305 310 315 320 

Phe Pro Asp Tyr Glu Glu Asn Gly Thr Asp Leu Ser Gly Ala Gly Asp 
325 330 335 

35 Pro Tyr Leu Asp Asp He Asp Asp Glu Met Asp Pro Glu He Glu Glu 

, . . „ _ - 340 - - ^5 - - 350- 



40 



Ala Tyr Glu Lys Phe Cys Leu Glu Ser Glu Arg Lys Arg Lys Gin 
355 360 365 



(2) INFORMATION FOR SEQ ID NO: 548: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) L^IGTH: 77 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548: 

50 

Met Leu Arg Leu Asp He He Asn Ser Leu Val Thr Thr Val Phe Met 
15 10 15 

Leu He Val Ser Val Leu Ala Ijeu He Pro Glu Thr Thr Thr Leu Thr 
55 20 25 30 

Val Gly Gly Gly Val Phe Ala Leu Val Thr Ala VsO. Cys Cys Leu Ala 
35 40 45 

60 Asp Gly Ala Leu He Tyr Arg Lys Leu Leu Phe Asn Pro Ser Gly Pro 
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50 



55 



60 



Tyr Gin Lys Lys Pro Val His Glu Lys Lys Glu Val Leu 
65 70 75 



(2) INFORMATION FOR SEQ ID KO: 549: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TVTE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549: 



15 



25 



40 



55 



Met Leu Lys Gin Val Met Phe Val Phe Ser Gly Met Gly Pro Arg Ser 
15 10 15 



His Cys Trp Gly Leu Pro Leu Ala Cys Gly Thr Phe Val Gin Gly His 
20 20 25 30 



Gin Ala Asp Ser Ser His Leu Leu Pro Leu Lys His Gin Gly Ala 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 550: 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 

35 Met Leu Leu Ser Leu Ala Ala Phe Ser Val lie Ser Val Val Ser Tyr 

.1 . _ . , 3 la 15 

Leu lie Leu Ala Leu Leu Ser Val Thr lie Ser Phe Arg lie Tyr Lys 
20 25 30 



Ser Val lie Gin Ala Val Gin Lys Ser Glu Glu Gly His Pro Phe Lys 
35 40 45 



Ala Tyr Leu Asp Val Asp lie Thr Leu Ser Ser Glu Ala Phe His Asn 
45 50 55 60 

Tyr Met Asn Ala Ala Met Val His lie Asn Arg Ala Leu Lys Leu lie 
65 70 75 80 

50 He Arg Leu Phe Leu Val Glu Asp Leu Val Asp Ser Leu Lys Leu Ala 

85 90 95 



Val Phe Met Trp Leu Met Thr Tyr Val Gly Ala Val Phe Asn Gly He 
100 105 110 

Thr Leu Leu He Leu Ala Glu Leu Leu He Phe Ser Val Pro He Val 
115 120 125 



Tyr Glu Lys Tyr Lys Thr Gin He Asp His Tyr Val Gly He Ala Arg 
60 130 135 140 
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Asp Gin Thr Lys Ser He Val Glu Lys He Gin Ala Lys Leu Pro Gly 
145 150 155 160 

-He Ala Lys Lys Lys Ala Glu Xaa 
165 



(2) INFORMATION FOR SEQ ID NO: 551: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linecir 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 

Ser Val Pro Phe His Leu Leu Val Val Leu Arg Ser Arg Ala Val Arg 
15 10 15 

Ala Arg Arg Arg Arg Glu Pro Arg Ser Leu Pro Arg Pro Gly Asp Glu 
20 25 30 

Glu Leu Gin Leu Leu Leu Cys Gly Ala Arg Ser Asp Phe Leu Glu Arg 
35 40 45 

Cys Glu Glu Asp Trp Val Cys Leu Trp His His Ala Asp His Ala Ala 
50 55 60 

Phe Pro Gly Ser Phe Gin Cys His Gin Cys Gly Phe Leu Pro His Pro 
65 70 75 80 

Gly Ser Ser Leu Cys His His Gin Leu Gin Asp Leu Gin Val Arg His 
85 90 95 

Pro Ser Cys. Thr Glu_ Val Arg Arg_Arg_ Pr_o_Sexi He Gln^ Ser JLeu Pro 
100 105 110 

Gly Arg Arg His Tyr Ser Val Leu Arg Ser Phe Pro 
115 120 



(2) INFORMATION FOR SEQ ID NO: 552: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids 

<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 

Met Val His Leu Leu Val Leu Ser Gly Ala Trp Gly Met Gin Met Trp 
15 10 15 

Val Thr Phe Val Ser Gly Phe Leu Leu Phe Arg Ser Leu Pro Arg His 
20 25 30 

Thr Phe Gly Leu Val Gin Ser Lys Leu Phe Pro Phe Tyr Phe His He 
35 40 45 
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Ser Met Gly Cys Ala Phe lie Asn Leu Cys He Leu Ala Ser Gin His 
50 55 60 

Ala Trp Ala Gin Leu Thr Phe Trp Glu Ala Ser Gin Leu Tyr Leu Leu 
5 " 65 70 75 80 

Phe Leu Ser Leu Thr Leu Ala Thr Val Asn Ala Arg Trp Leu Glu Pro 
85 90 95 

10 Arg Thr Thr Ala Ala Met Trp Ala Leu Gin Thr Val Glu Lys Glu Arg 
100 105 110 

Gly Leu Gly Gly Glu Val Pro Gly Ser His Gin Gly Pro Asp Pro Tyr 
115 120 125 

15 

Arg Gin Leu Arg Glu Lys Asp Pro Lys Tyr Ser Ala Leu Arg Gin Asn 
130 135 140 

Phe Phe Arg Tyr His Gly Leu Ser Ser Leu Cys Asn Leu Gly Cys Val 
20 145 150 155 160 

Leu Ser Asn Gly Leu Cys Leu Ala Gly Leu Ala Leu Glu He Arg Ser 
165 170 175 

25 Leu 



30 (2) INFORMATION FOR SEQ ID NO: 553: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

, (xi) _SEQUENCE_DEa::RIPTION^_SBQ JD^NOl 553: 



40 



Met Ala Phe lie Leu Leu Phe Tyr Cys L.eu Met Thr Phe Leu Ser Leu 
15 10 15 

Glu Gin Asn Ser Ala Thr Val Glu Pro Ser Ser His Glu lie Leu His 
20 25 30 



Leu Leu Gin Asn Cys Phe Glu Leu Leu Arg Thr Ser Thr Ser Gin Cys 
45 35 40 45 

Thr Glu Gly lie Pro Cys Ala Lys lie Pro Glu Trp Val Thr His Leu 
50 55 60 

50 Thr Trp Gin Thr Leu Lys. Asn Ser 
65 70 



55 (2) INFORMATION FOR SEQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 554: 

Val Leu Arg He He Cys Leu Trp Pro Cys Gly Thr Thr Leu Pro Leu 
15 10 15 

Val Glu Lys Ala His Asp Ser His Ser Ala Asp Pro Val Cys Pro Gly 
20 25 30 

Leu Thr Ala His Leu Pro Val Leu Leu Tyr Val Gin Leu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 555: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acicJs 
(B> TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555: 



Met Lys His Ala Asp Pro Arg He Gin Gly Tyr Pro Leu Met Gly Ser 
15 10 15 

Pro Leu Leu Met Thr Ser He Leu Leu Thr Tyr Val Tyr Phe Val Leu 
20 25 30 

Ser Leu Gly Pro Arg He Met Ala Asn Arg Lys E>ro Phe Gin Leu Arg 
35 40 45 

Gly Phe Met He Val Tyr Asn Phe Ser Leu Val Ala Leu Ser Leu Tyr 
50 55 60 



He Val Tyr Glu Phe Leu Met Ser Gly Trp Leu Ser Thr Tyr Thr Trp 
65 70 75 80 

Arg Cys Asp Pro Gin Asp Cys Thr Leu Gly Gin Cys Pro Ser Val Pro 
85 90 95 

Ser Pro Xaa Thr Pro Val Thr Lys Ala Tyr Val Val Arg Thr Glu Gin 
100 105 110 

Gly Thr Gly Pro Pro Leu Pro Thr Ala Ala Leu Gin Gly Pro Arg Leu 
115 120 125 

Trp Phe Leu Thr His Phe Pro Arg Ala Ala Pro Gly Met Trp Pro His 
130 135 140 

Cys Cys Leu Pro Leu Gin Ser Trp Gly Leu Lys Gly Leu Tyr Ser Tyr 
145 150 155 160 

Phe Pro Leu Pro Ala Leu Lys l^eu Gly Arg Gly Ala Leu Arg Ala Gly 
165 170 175 

Pro Thr Lys Gly Leu Val Ala Phe Phe Leu Thr Gin Lys Arg Ser Ala 
180 185 190 



He Met Ser Leu Trp Thr Gin Ser His Ser Ser Thr Pro His Thr Glu 
195 200 205 
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Ala Val Ala Ser Gly Pro Lys Val Arg Val Gly Gly Gly Leu Gly He 
210 215 220 

Gin Pro Val Glu Ala Ala Tyr Ser Thr Cys Val Leu He Lys Ser Asp 
5 "225 230 235 240 

Arg Gly Asn His Glu Lys Lys Lys Lys Lys Lys 
245 250 



10 



25 



35 



50 



(2) INFORMATION FOR SEQ ID NO: 556: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) I^GTH: 19 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 556: 

20 Gly Leu Ala Gly Leu Cys Gly Gin Leu Ser Ser Pro Ala Leu Cys Val 
15 10 15 



Asn Arg Leu 



(2) INFORMATION FOR SEQ ID NO: 557: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linecir 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557: 



Met lie Thr Glu Lys Trp Gly Leu Asn Met Glu Tyr Cys Arg Gly Gin 

i ' " - - - ^ - - 10 ' 15 



Ala Tyr lie Xaa Ser Ser Gly Phe Ser Ser Lys Met Lys Val Val Ala 
40 20 25 30 

Ser Arg Leu Leu Glu Lys Tyr Pro Gin Ala lie Tyr Thr Leu Cys Ser 
35 40 45 

45 Ser Cys Ala Leu Asn Met Trp Leu Ala Lys Ser Val Pro Val Met Gly 
50 55 60 



Val Ser Val Ala Leu Gly Thr lie Glu Glu Val Cys Ser Phe Phe His 
65 70 75 80 

Arg Ser Pro Gin Leu Leu Leu Glu Leu Asp Asn Val lie Ser Val Leu 
85 90 95 



Phe Gin Asn Ser Lys Glu Arg Gly Lys Glu Leu Lys Glu lie Cys His 
55 100 105 110 

Ser Gin Trp Thr Gly Arg His Asp Ala Phe Glu lie Leu Val Glu Leu 
115 120 125 

60 Leu Gin Ala Leu Val Leu Cys Leu Asp Gly lie Asn Ser Asp Thr Asn 
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130 135 140 

He Arg Trp Asn Asn Tyx He Ala Gly Arg Ala Phe Val Leu Cys Ser. 
145 150 155 160 

Ala Val Ser Asp Phe Asp Phe He Val Thr He Val Val Leu Lys Asn 
165 170 175 

Val Leu Ser Phe Thr Arg Ala Phe Gly Lys Asn Leu Gin Gly Gin Thr 
180 185 190 

Ser Asp Val Phe Phe Ala Ala Gly Ser Leu Thr Ala Val Leu His Ser 
195 200 205 

Leu Asn Glu Val He Gly Lys Tyr Xaa 
210 215 



(2) INFORMATION FOR SEQ ID ^30: 558: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 

Leu Leu Lys Val Leu Cys He Leu Pro Val Met Lys Val Glu Asn Glu 
15 10 15 

Arg Tyr Glu Asn Gly Arg Lys Arg Leu Lys Ala Tyr Leu Arg Asn Thr 
20 25 30 

Leu Thr Asp Gin Arg Ser Ser Asn Leu Ala Leu Leu Asn He Asn Phe 
35 40 45 

Asp He Lys His Asp Leu Asp Leu Met Val Asp Thr Tyr He Lys Leu 
50 55 60 

Tyr Thr Ser Lys Ser Glu Leu Pro Thr Asp Asn Ser Glu Thr Val Glu 
65 70 75 80 

Asn Thr 



(2) INFORMATION FOR SEQ ID NO: 559: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LrajGTH: 95 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559: 

Met Val Leu He Leu Leu Asn Leu Leu Leu Gly Gin Phe Ser Cys Met 
15 10 15 

Ser Pro Ala Ser His His Cys His Pro Leu Pro Thr Glu Met Pro Cys 
20 25 30 
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15 



20 



683 



Sex Ser Asp Trp Gly Phe Asp Ser His Thr Val Tyr Pro Ser Cys Val 
35 40 45 

"Asp Ala Leu Leu Pro Lys Pro Ser Ala Asn Ser Phe Pro Asn Gly Ser 
50 55 60 

Cys His Cys Gin Gly Leu Tyr Asn Gin Gin Gin Gin Asn Leu His Ala 
65- 70 75 80 

Ala Glu Gly Pro Ala Ser Leu Arg Cys Asn Lys Tyr Val Ser Thr 
85 90 95 



(2) INFORMATION FOR SEQ ID ^R^: 560: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560: 



Met lie Pro Ala Tyr Ser Lys Asn Arg Ala Tyr Ala lie Phe Phe He 
25 1 5 10 15 

Val Phe Thr Val He Gly Asp Ala Pro Gly Ala Val Leu Ser Cys Ala 
20 25 30 

30 Gly His Pro Cys Val Gly Phe Ala Ala Val Leu Val Ala Pro Leu Thr 
35 40 45 



35 



Val Ala Val Ser Ser Xaa 
50 



(2) INFORMATION FOR SEQ ID NO: 561: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561: 



45 



Met Glu Val Pro Pro Pro Ala Pro Arg Ser Phe Leu Cys Arg Ala Leu 
^5 10 15 



Cys Leu Phe Pro Arg Val Phe Ala Ala Glu Ala Val Thr Ala Asp Ser 
50 20 25 30 

Glu Val Leu Glu Glu Arg Gin Lys Arg Leu Pro Tyr Val Pro Glu Pro 
35 40 45 

55 Tyr Tyr Pro Glu Ser Gly Trp Asp Arg Leu Arg Glu Leu Phe Gly Lys 
50 55 60 



60 



Asp Thr Val Asn Thr Ser Leu Asn Val Tyr Arg Asn Lys Asp Ala Leu 
^5 70 75 80 
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Ser His Phe Val lie Ala Gly Ala Val Thr Gly Ser Leu Phe Arg lie 
85 . 90 95 

Asn Val Gly Leu Arg Gly Trp Trp Leu Val Ala Xaa 
5 " 100 105 



10 



(2) INFORMATION FOR SEQ ID NO: 562: 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 

Met Asn Trp Gly Leu Ser He Trp Leu His Tyr Tyr Glu Lys Lys Lys 
1 5 10 15 

20 Glu Gin Val Phe Leu Val He Leu Ala His Val Val Arg Arg Cys Ala 
20 25 ~ 30 



25 



Ser Asp Gly He Leu Gin Phe Glu Ser Ser Leu Leu Lys Met Arg Arg 
35 40 45 

Ala Pro 
50 



30 

(2) INFORMATION FOR SEQ ID NO: 563: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 
35 (B) TYPE : amino acid 

„ (PI TOPOLOGY: linear _ _ . 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563: 

Met Val Lys Val Cys Asn Asp Ser Asp Arg Trp Ser Leu He Ser Leu 
40 1 5 10 15 

Ser Asn Asn Ser Gly Lys Asn Val Glu Leu Lys Phe Val Asp Ser Leu 
20 25 30 

45 Arg Arg Gin Phe Glu Phe Ser Val Asp Ser Phe Gin He Lys Leu Asp 
35 40 45 

Ser Leu Leu Leu Phe Tyr Glu Cys Ser Glu Asn Pro Met Thr Glu Thr 
50 55 60 

50 

Phe His Pro Thr He He Gly Glu Ser Val Tyr Gly Asp Phe Gin Glu 
65 70 75 80 

Ala Phe Asp His Leu Cys Asn Lys He He Ala Thr Arg Asn Pro Glu 
55 85 90 95 

Glu He Arg Gly Gly Gly Leu Leu Lys Tyr Cys Asn Leu Leu Val Arg 
100 105 110 

60 Gly Phe Arg Pro Ala Ser Asp Glu He Lys Ihr Leu Gin Arg Tyr Met 



SOOCID: <WO_983944aA^IA> 



wo 98/39448 



PCT/US98/04493 



685 



115 120 125 

Cys Ser Arg Phe Phe lie Asp Phe Ser Asp lie Gly Glu Gin Gin Arg. 
130 135 140 

5 - 

Lys Leu Glu Ser Tyr Leu Gin Asn His Phe Val Gly Leu Glu Asp Arg 
145 150 155 160 

Lys Tyr Glu Tyr Leu Met Thr Leu His Gly Val Val Asn Glu Ser Thr 
10 165 170 175 

Val Cys Leu Met Gly His Glu Arg Arg Gin Thr Leu Asn Leu lie Thr 
180 185 190 

15 Met Leu Ala lie Arg Val Leu Ala Asp Gin Asn Val lie Pro Asn Val 
195 200 205 



20 



Ala Asn Val Thr Cys Tyr Tyr Gin Pro Ala Pro Tyr Val Ala Asp Ala 
210 215 220 

Asn Phe Ser Asn Tyr Tyr lie Ala Gin Val Gin Pro Val Phe Thr Cys 
225 230 235 240 



Gin Gin Gin Thr Tyr Ser Thr Trp Leu Pro Cys Asn Xaa 
25 245 250 



30 



(2) INFORMATION FOR SEQ ID NO: 564: 



(i) SEQUENCE CHARACTERISTICS: 

tA) LENGTH: 18 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564: 



Met Ser Phe Leu Met Trp Leu Met Ser Leu Ala lie Thr Ser Gin Pro 
1 5 10 15 



40 Pro Met 



45 (2) INFORMATION FOR SEQ ID NO: 565: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565: 



55 



Met Ala Pro Lys Gly Lys Val Gly Thr Arg Gly Lys Lys Gin lie Phe 
15 10 15 

Glu Glu Asn Arg Glu Thr Leu Lys Phe Tyr Leu Arg lie lie Leu Gly 
20 25 30 



Ala Asn Ala lie Tyr Cys Leu Val Thr Leu Val Phe Phe Tyr Ser Ser 
60 35 40 45 
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Ala Ser Phe Trp Ala Trp Leu Ala Leu Gly Phe Ser Leu Ala Val Tyr 
50 55 60 

"Gly Ala Ser Tyr His Ser Met Ser Ser Met Ala Arg Ala Ala Phe Phe 
65 70 75 80 



(2) INFORMATION FOR SEQ ID NO: 566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566: 

His Leu Lys Asp Val lie Leu Leu Thr Ala lie Val Gin Val Leu Ser 
15 10 15 

Cys Phe Ser Leu Tyr Val Trp Ser Phe Trp Leu Leu Ala Pro Gly Arg 
20 25 30 

Ala Leu Tyr Leu Leu Trp Val Asn Val Leu Gly Pro Trp Phe Thr Ala 
35 40 45 

Asp Ser Gly Thr Pro Ala Pro Glu His Asn Glu Lys Arg Gin Arg Arg 
50 55 60 

Gin Glu Arg Arg Gin Met Lys Arg Leu 
65 70 



(2) INFORMATION FOR SEQ ID NO: 567: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567: 

Met Asp Cys Pro Ala Leu Pro Pro Gly Trp Lys Lys Glu Glu Val lie 
15 10 15 

Arg Lys Ser Gly Leu Ser Ala Gly Lys Ser Asp Val Tyr Tyr Phe Ser 
20 25 30 

Pro Ser Gly Lys Lys Phe Arg Ser Lys Pro Gin Leu Ala Arg Tyr I»eu 
35 40 45 

Gly Asn Thr Val Asp Leu Ser Ser Phe Asp Phe Arg Thr Gly Lys Met 
50 55 60 



Met Pro Ser Lys Leu Gin Lys Asn Lys Gin Arg Leu Arg Asn Asp Pro 
65 70 75 80 
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Leu Asn Gin Asn Lys Gly Lys Pro Asp Leu Asn Thr Thr Leu Pro lie 
85 90 95 

Arg Gin Thr Ala Ser lie Phe Lys Gin Pro Val Thr Lys Val Thr Asn 
5 ~ 100 105 110 

His Pro Ser Asn Lys Val Lys Ser Asp Pro Gin Arg Met Asn Glu Gin 
115 120 125 

10 Pro Arg Gin Leu Phe Trp Glu Lys Arg Leu Gin Gly Leu Ser Ala Ser 
130 135 140 

Asp Val Thr Glu Gin lie lie Lys Thr Met Glu Leu Pro Lys Gly Leu 
145 150 155 160 

Gin Gly Val Gly Pro Gly Ser Asn Asp Glu Thr Leu Leu Ser Ala Val 
165 170 175 



15 



Ala Ser Ala Leu His Thr Ser Ser Ala Pro lie Thr Gly Gin Val Ser 
20 180 185 190 

Ala Ala Val Glu Lys Asn Pro Ala Val Trp Leu Asn Thr Ser Gin Pro 
195 200 205 

25 Leu Cys Lys Ala Phe lie Val Thr Asp Glu Asp He Arg Lys Gin Glu 
210 215 220 



30 



Glu Arg Val Gin Gin Val Arg Lys Lys Leu Glu Glu Ala Leu Met Ala 
225 230 235 240 

Asp He Leu Ser Arg Ala Ala Asp Thr Glu Glu Met Asp He Glu Met 
245 250 255 



Asp Ser Gly Asp Glu Ala Xaa 
35 260 



40 



(2) INFORMATION FOR SEQ ID NO: 568: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LE3NGTH: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION; SEQ ID NO: 568: 

Met Met Arg Pro Phe Tyr Leu Leu Leu Pro Val Leu Cys Thr Gin Ala 

1 .5 10 15 

50 I-eu Arg Gin Ser Gin Gly Lys Ser Pro Leu Leu Trp Lys Arg Thr Leu 
20 25 30 

Leu Phe Gly Leu Thr His Leu Asn Pro Ser Ala Lys Leu Leu Leu Ser 
35 40 45 



55 



Gin Met Lys Thr Ser Gly Asn Arg Lys Ser Glu Tyr Ser Lys Tyr Ala 
50 55 60 



Arg Asn Trp Lys Lys His 
60 65 70 
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(2) INFORMATION FOR SEQ ID NO: 569: 

5 - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569: 

Met Pro Val Thr Ser Lys Arg Thr Leu Phe Phe Pro Asp Pro Cys Ser 
1 5 10 15 

15 Tyr Asp Thr Pro Pro Pro Asp Cys His Cys His Ser Phe Arg Ala Glu 
20 25 30 

Leu Leu 

20 



(2) INFORMATION FOR SEQ ID NO: 570: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570: 



30 



45 



Met Asn Ser Arg Gly Met Trp Leu Thr Tyr Ala Leu Gly Val Gly Leu 
15 10 15 



Leu His lie Val Leu Leu Ser lie Pro Phe Phe Ser Val Pro Val Ala 
35 20 25 30 

Trp Thr Leu Thr Asn lie lie His Asn Leu Gly Met Tyr Val Phe Leu 
35 40 45 

40 His Ala Val Lys Gly Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ser 
50 55 60 



Lys Ala Pro Asn Ser Leu Gly Thr Thr Gly Leu Trp Ser Thr Val Tyr 
65 70 75 80 

lie Phe Thr Glu Val Phe His Asn Phe Ser Asn Asn Ser lie Phe Ser 
85 90 95 



Gly Lys Phe Leu Tyr Glu Val Xaa 
50 100 



55 



(2) INFORMATION FOR SEQ ID NO: 571: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ XD NO: 571: 
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Met Trp Leu Thx Tyr Ala Leu Gly Val Gly Leu Leu His lie Val Leu 
1 5 10 15 . 

5 'Leu Ser lie Pro Phe Phe Ser Val Pro Val Ala Trp Thr Leu Thr Asn 
20 25 30 

lie He His Asn Leu Gly Met Tyr Val Phe Leu His Ala Val Lys Gly 
35 40 45 

10 

Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ala Arg Leu Leu Thr His 
50 55 60 

Trp Glu Gin Leu Asp Tyr Gly Val Gin Phe Thr Ser Ser Arg Lys Phe 
15 65 70 75 80 

Phe Thr He Ser Pro He He Leu Tyr Phe Leu Ala Ser Phe Tyr Thr 
85 90 95 

20 Lys Tyr Asp Pro Thr His Phe He Leu Asn Thr Ala Ser Leu Leu Ser 
100 105 110 

Val Leu He Pro Lys Met Pro Gin Leu His Gly Val Arg He Phe Gly 
115 120 125 

25 

He Asn Lys Tyr 
130 



30 



(2) INFORMATION FOR SEQ ID NO: 572: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
35 (B) TYPE: amino acid 

t^L'^FP^^^L A^P^^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572: 

Met Asn Lys Trp He Cys Glu Met His Cys Tyr Leu Val Leu Leu Ser 
40 1 5 10 15 

Val Cys Ser Pro Ser Ala Leu Arg Arg Val Arg His Thr L»eu Ser Arg 
20 25 30 

45 



50 (2) INFORMATION FOR SEQ ID NO: 573: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573: 



60 



Met Pro Val Leu Ser Leu Leu Cys Thr Leu He Val Ser Phe Gin Ser 
15 10 15 
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Ala Asp Ser Cys Glu Val Phe Leu Asn Cys Ser Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 574: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574: 

Met Lys VaJ. Ser Thr Met Leu Trp Phe Leu Cys Trp Glu Gin Ser His 
15 1 5 10 15 

Phe Leu Arg Glu Trp Glu Asp Leu Ser Thr Phe Leu lie Leu lie Gin 
20 25 30 

20 Met Glu Cys Gin Tyr Gly Asn Ser 
35 40 



25 (2) INFORMATION FOR SEQ ID NO: 575: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575: 



35 



40 



55 



Met Gly Leu Pro Leu Met Ala Leu Met Trp Ser Thr Leu Pro Ala Ser 
15 10 15 

_Ala. Gly Val J^jn_Phe lle.Leu Ala Leu Fro„Leu Leu ,Xaa. Leu ._ 

20 25 30 



(2) INFORMATION FOR SEQ ID NO: 576: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576: 

Met Lys Arg Gly Cys Leu Gly Leu Leu Phe Phe Ser Cys Cys Ser Ser 
50 1 5 10 15 

Ala Pro Thr Met Leu Leu Cys Asp Tyr Leu Asn Trp Phe 
20 25 



(2) INFORMATION FOR SEQ ID NO: 577: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 92 amino acids 
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(B) TYPE: amino acid 
(D) TOPOliOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577: 

5 ~Met Lys Leu Leu Leu Gly lie Ala Leu Leu Ala Tyr Val Ala Ser Val 
15 10 15 

Trp Gly Asn Phe Val Asn Met Arg Ser He Gin Glu Asn Gly Glu Leu 
20 25 30 

10 

Lys He Glu Ser Lys He Glu Glu Met Val Glu Pro Leu Arg Glu Lys 
35 40 45 

He Arg Asp Leu Glu Lys Ser Phe Thr Gin Lys Tyr Pro Pro Val Lys 
15 50 55 60 

Phe Leu Ser Glu Lys Asp Arg Lys Arg He Leu Xaa Asn Arg Arg Arg 
65 70 75 80 

20 Xaa Val Arg Gly Leu Pro Ser Xaa Leu Thr Asn Ser 

85 90 



25 (2) INFORMATION FOR SEQ ID NO: 578: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578: 



35 



Met Lys Phe Ser Leu Val Leu Leu He Lys He He Ser Phe Glu Arg 
15 10 15 

Leu Leu He Phe Leu Phe Pro Leu Ser Phe Leu Pro Asn He Trp Arg 
20 25 30 



Arg Val Met Val Asn Leu Asn He Leu Phe 
40 35 40 



45 



<2) INFORMATION FOR SEQ ID NO: 579: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579: 

Leu Ala Gin Glu Cys Pro Pro His He Pro Ser Ser Phe Phe l*eu Val 
15 10 15 

55 Lys Leu Leu Phe He Pro Trp Leu Ala Ser Leu Leu Pro Pro Leu Ser 
20 25 30 

Thr Phe Thr Ser Asp Phe Tyr Phe Met Glu Phe Gly He Glu Val Lys 
35 40 45 

60 
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Leu Gin Gin Cys Arg Gin His Gin Val Leu Gin Glu Lys Asn Thr Lys 
50 55 €0 

Lys Phe Asn Lys Lys Lys 
- 65 70 



(2) INFORMATION FOR SEQ ID NO: 580: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580: 

Met Leu Arg Leu Leu Leu Leu Val Ala Phe Ala Leu Val Val Val Leu 
15 10 15 

Phe His Val Leu Leu Ala Pro lie Thr Ala Leu Phe His Thr His Phe 
20 25 30 

Tyr Asp Arg Leu Gin Asp Ala Gly Ser Arg Trp Pro Glu Leu Tyr Leu 
35 40 45 

Tyr Ser Arg Ala Asp Glu Val Val Leu Ala Arg Asp He Glu Arg Met 
50 55 60 

Val Glu Ala Arg Leu Ala Arg Arg Val Leu Ala Arg Ser Val Asp Phe 
65 70 75 80 

Val Ser Ser Ala His Val Ser His Leu Arg Asp Tyr Pro Thr Tyr Tyr 
85 90 95 

Thr Ser Leu Cys Val Asp Phe Met Arg Asn Cys Val Arg Cys 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 581: 

(i) SEQUENCE CHARACTERISTICS: 

(A) L^IGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581: 

Met Phe Lys Leu Glu Glu Cys Gly Lys Thr Thr Phe Leu Leu Ser Met 
15 10 15 

Ala Leu Tyr Phe Trp Trp He Val Gin Thr Thr Lys Gly Cys 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 582: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LE3«IGTH: 71 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582: 

Met Glu Ser Asp Ala Leu Leu Leu Thr lie Phe Trp He He Ala Arg 
5-1 5 10 15 

Ser Ser Val Arg Ser Val Gly Lys Ser Ser Gin Arg Ser Phe Thr Thr 
20 25 30 

10 He Thr Gin Leu Arg Ser Thr His Thr Gly Pro Ser Arg Arg Ser Tyr 
35 40 45 



15 



20 



50 



Leu He Trp Trp Asn Gly Gly Pro Lys Arg Thr He Ser Tyr Val Ser 
50 55 60 

Arg Arg Phe Arg Ser Phe Arg 
65 70 



(2) INFORMATION FOR SEQ ID NO; 583: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583: 

Val Gly Leu Phe Gin Pro Lys Thr Phe Gin Val Pro Val Thr Asp Leu 
30 1 5 10 15 

Tyr He Phe He Lys He Tyr Ser Glu He Gly Pro He Met His Val 
20 ' 25 30 

35 Leu Cys Pro Gly Tyr Ser Gin Ser Pro Ser Thr Pro Pro Trp Thr 
35 40 45 



40 (2) INFORMATION FOR SEQ ID NO: 584: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584: 



Met Trp Phe Gly Ser Asp Arg Ser Asp Leu Arg He Gly Thr Ala Phe 
15 10 15 

Leu Phe Asp Leu Val Cys Asp Leu Cys He His Ala Trp Lys Pro Pro 
20 25 30 



Gly Leu Val Arg Phe Ser Phe 
55 35 



(2) INFORMATION FOR SEQ ID NO: 585: 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TVPE: amino acid 
(D) TOPOLOGY: linear 

5 " (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 585: 

Met Leu Asn Thr Ala Ser Leu Asn Leu Pro Trp Lys Val Gin Leu Phe 
15 10 15 

10 Ala His Ala 



15 (2) INFORMATION FOR SEQ ID NO: 586: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586: 



25 



Met Ser Ala Cys Leu Leu Leu Phe Leu Ala Phe Ser Trp Lys Arg Lys 
15 10 15 

Gly Leu Trp Ser Gly Pro Gly 
20 



30 



50 



(2) INFORMATION FOR SEQ ID NO: 587: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LHOGTH: 69 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587: 

Met Leu Pro Pro Phe Ser Leu Val Tyr Thr His Phe Leu Val Ala Ser 
40 1 5 10 15 

Leu Leu Pro Val lie Leu Ala Val Phe Pro Asp Ser Ala Gin lie Val 
20 25 30 

45 Pro Leu Leu Lys Pro He Pro Arg Pro Gin Pro Glu Val He Phe Pro 
35 40 45 



Ser Ser Glu Leu Leu Glu Gin Leu Leu Ser Val Gin Phe Val Trp Gin 
50 55 60 

Ala His Thr Val Ala 
65 



55 

(2) INFORMATION FOR SEQ ID NO: 588: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LETKSTH: 77 amino acids 
60 (B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588: 

Met Gly Pro Pro Met Leu Gin Glu lie Ser Asn Leu Phe Leu lie Leu 
5 " 1 5 10 15 

Lieu Met Met Gly Ala He Phe Thr Leu Ala Ala Leu Lys Glu Ser Leu 
20 25 30 

10 Ser Thr Cys He Pro Ala He Val Cys Leu Gly Phe Leu Leu Leu Leu 
35 40 45 



15 



20 



Asn Val Gly Gin Leu Leu Ala Gin Thr Lys Lys Val Val Arg Pro Thr 
50 55 60 

Arg Lys Lys Thr Leu Ser Thr Phe Lys Glu Ser Trp Lys 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 589: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589: 

Met Ala Leu Leu L.eu Ser Val Leu Arg Val Leu Leu Gly Gly Phe Phe 
30 1 5 10 15 

Ala Leu Val Gly Leu Ala Lys Leu Ser Glu Glu He Ser Ala Pro Val 
20 25 30 

35 Ser Glu Arg Met Asn Ala Leu Phe Val Gin Phe Ala Glu Val Phe Pro 
35 40 45 



40 



Leu Lys Val Phe Gly Tyr Gin Pro Asp Pro Leu Asn Tyr Gin He Ala 
50 55 60 

Val Gly Phe Leu Glu Leu Leu Ala Gly Leu Leu Leu Val Met Gly Pro 
65 70 75 80 



Pro Met Leu Gin Glu He Ser Asn Leu Phe Leu He Leu Leu Met Met 
45 85 90 95 

Gly Ala He Phe Thr Leu Ala Ala Leu Lys Glu Ser Leu Ser Thr Cys 
100 105 110 

50 lie Pro Ala lie Val Cys Leu Gly Phe Leu Leu Leu Leu Asn Val Gly 
115 120 125 



55 



60 



Gin Leu Leu Ala Gin Thr Lys Lys Val Val Arg Pro Thr Arg Lys Lys 
130 135 140 

Thr Leu Ser Thr Phe Lys Glu Ser Trp Lys Xaa 
145 150 155 
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(2) INFORMATION FOR SEQ ID NO: 590: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 
5 ' (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590: 

Met Pro Glu Thr Arg Leu Gly His Arg Gin Gin Phe Ala Val Phe His 
10 1 5 10 15 

Leu Xaa Pro Val Pro Pro Cys Gly 
20 



15 



30 



(2) INFORMATION FOR SEQ ID NO: 591: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591: 

25 Met Leu Thr Phe Leu Phe Ser Ala Cys Ala Thr Cys Leu Gly Lys Leu 
15 10 15 

Ala Ser Pro Leu Ala Pro Val Gly Pro Gin Gin Arg Gly Xaa Pro Pro 
20 25 30 



Gly Pro Pro Leu Leu Ser 
35 



35 

(2) INFORMATION FOR SEQ ID NO: 592: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 69 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592: 

Met Asp Pro Phe His Tyr Asp Tyr Gin Thr Leu Arg lie Gly Gly Leu 
45 1 5 10 15 

Val Phe Ala Val Val Leu Phe Ser Val Gly lie Leu Leu lie Leu Ser 
20 25 30 

50 Arg Arg Cys Lys Cys Ser Phe Asn Gin Lys Pro Arg Ala Pro Gly Asp 
35 40 45 

Glu Glu Ala Gin Val Glu Asn Leu lie Thr Ala Asn Ala Thr Glu Pro 
50 55 60 

55 

Gin Lys Ala Glu Asn 
65 



60 
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(2) INFORMATION FX>R SEQ ID NO: 593: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 308 anvlno acids 
5 " (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593: 

Asn Leu Arg Val Arg Leu Gly Asp Val lie Ser lie Gin Pro Cys Pro 
10 1 5 10 15 

Asp Val Lys Tyr Gly Lys Arg lie His Val Leu Pro lie Asp Asp Thr 
20 25 30 

15 Val Glu Gly lie Thr Gly Asn Leu Phe Glu Val Tyr Leu Lys Pro Tyr 
35 40 45 

Phe Leu Glu Ala Tyr Arg Pro lie Arg Lys Gly Asp lie Phe Leu Val 
50 55 60 

Arg Gly Gly Met Arg Ala Val Glu Phe Lys Val Val Glu Thr Asp Pro 
65 70 75 80 



20 



Ser Pro Tyr Cys lie Val Ala Pro Asp Thr Val lie His Cys Glu Gly 
25 85 90 95 

Glu Pro lie Lys Arg Glu Asp Glu Glu Glu Ser Leu Asn Glu Val Gly 
100 105 110 

30 Tyr Asp Asp lie Gly Gly Cys Arg Lys Gin Leu Ala Gin lie Lys Glu 
115 120 125 



35 



Met Val Glu Leu Pro Leu Arg His Pro Ala Leu Phe Lys Ala lie Gly 
130 135 140 

Val Lys Pro Pro Arg Gly lie Leu Leu Tyr Gly Pro Pro Gly Thr Gly 
145 150 155 160 



Lys Thr Leu lie Ala Arg Ala Val Ala Asn Glu Thr Gly Ala Phe Phe 
40 165 170 175 

Phe Leu He Asn Gly Pro Glu He Met Ser Lys Leu Ala Gly Glu Ser 
180 185 190 

45 Glu Ser Asn Leu Arg Lys Ala Phe Glu Glu Ala Glu Lys Asn Ala Pro 
195 200 205 



50 



Ala He He Phe He Asp Glu Leu Asp Ala He Ala Pro Lys Arg Glu 
210 215 220 

Lys Thr His Gly Glu Val Glu Arg Arg He Val Ser Gin Leu Leu Thr 
225 230 235 240 



Leu Met Asp Gly Leu Lys Gin Arg Ala His Val He Val Met Ala Ala 
55 245 250 255 

Thr Asn Arg Pro Asn Ser He Asp Pro Ala teu Arg Arg Phe Gly Arg 
260 265 270 

60 Phe Asp Arg Glu Val Asp He Gly He Pro Asp Ala Thr Gly Arg Leu 
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275 280 285 

Glu lie Leu Gin He His Thr Lys Asn Met Lys Leu Ala Asp Asp Val. 
290 295 300 

5 - 

Asp Leu Glu Gin 
305 



10 

(2) INFORMATION FOR SEQ ID NO: 594: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594: 

Met Gin He Lys Leu Leu Lys Ser Val Lys Thr Val Phe Ala He Thr 
20 1 5 10 15 

Leu Leu Val Leu Phe Leu 
20 

25 

(2) INFORMATION FOR SEQ ID NO: 595: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 24 aroino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 595: 

35 Met Phe Pro Lys Phe Cys Pro He Leu Ser Leu Val Asp Phe He Ser 
1 5 10 15 

His Arg Asp Lys Pro Glu Thr Glu 
20 

40 



(2) INFORMATION FOR SEQ ID NO: 596: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596: 

50 

Met Leu He Glu Cys Ala Trp Gin Leu Met Phe Leu Leu Leu Lys Val 
15 10 15 

Glu Gin Leu Gly He Leu Asp Lys 
55 20 



(2) INFORMATION FOR SEQ ID NO: 597: 

60 
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(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
5 ■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597: 

Met 
1 

10 

(2) INFORMATION FOR SEQ ID NO: 598: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE tJESCRIPTION : SEQ ID NO: 598: 

20 Met Cys lie Met Ser Ala Leu Val 
1 5 



25 (2) INFORMATION FOR SEQ ID NO: 599: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599: 



35 



Met Phe Leu Val Trp Phe Phe Trp Gly Leu He Ser Ala Leu Ser Asn 
15 10 15 

Val His Thr Pro Ser Arg Leu Pro Ala 
20 25 



40 

(2) INFORMATION FOR SEQ ID NO: 600: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEITCTH: 27 amino acids 
45 (B) TYPE: amino acid 

<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600: 

Met Xaa Gly Leu Ser Leu He Leu Thr Val Thr Leu Leu Ala Val Ser 
50 1 5 10 15 

Asp Ser Ala Ala Thr Cys He Val Ala Lys Gly 
20 25 

55 

(2) INFORMATION FOR SEQ ID NO: 601: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 61 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601: 

Het Trp Thr Arg Ser Ser Arg Cys Leu Leu Leu Cys lie Pro Gly Xaa 
15 10 15 

Ser Arg Arg Arg Arg Ala Gly Ser Gly Met Lys Pro Arg Ser Trp Ser 
20 25 30 

Ala Trp Arg Pro Ser Gly Gly Thr Gly Thr Ser Ser Ser Gin Ser Ser 
35 40 45 



Thr Gin Ser Arg Thr Leu Ser Ala Thr Ala Ser Pro Ala 
15 50 ^ 55 60 



<2) INFORMATION FOR SEQ ID NO: 602: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 602: 

Met Arg Glu Thr Ser lie Arg Val Leu Leu Met Leu Pro Ala Leu Glu 
15 10 15 

30 Ser Thr Ser Gly Leu Ser Ala Phe Met Gly Leu Gly Thr 
20 25 



35 (2) INFORMATION FOR SEQ ID NO: 603: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

<B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603: 

Met Pro Pro Lys Gin Glu Leu Gly Ser Gly Val Gly Glu Leu Ala Lys 
1 5 10 15 

45 

Asn Ser Lys Arg Gin His Trp Asn His Arg Trp Lys Lys Tyr Leu Lys 
20 25 30 

Leu lie Arg Trp Glu Asp Gly Leu Leu Leu Glu Gly Leu Leu Leu Val 
50 35 40 45 

Leu Glu His Cys Ala Thr Met Ala Trp Asp Cys Leu Met Arg Leu Glu 
50 55 60 

55 Leu Leu Lys Arg Leu 
65 



60 (2) INFORMATION FOR SEQ ID NO: 604: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
5 - (D) TOPOIjOOT: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604: 

Lys lie Val Tyr lie Leu Gly Asn Pro Leu Lys Phe Asn Ser Arg Val 
15 10 15 

10 

lie His His Leu Val Leu Leu Gin 
' 20 



15 

(2) INFORMATION FOR SEQ ID NO: 605: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 ajtiino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605: 

Met Asn Leu His Gin Arg Arg Leu Leu Leu lie Gly His Leu Met Thr 
25 1 5 10 15 

Leu Val Lys Ala Ser Lys Ser Phe Ser Phe Thr Glu lie Thr Ser Ser 
20 25 30 

30 Arg Lys Lys 
35 



35 (2) INFORMATION FOR SEQ ID NO: 606: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606: 



45 



Leu Leu Gly Tyr Gly Leu Phe Gly His Cys He Val Leu Phe He Thr 

15 10 15 

Tyr Asn He His Leu His Ala Leu Phe Tyr Leu Phe Trp Leu Leu Val 

20 25 30 



Gly Gly Leu Ser Thr Leu Arg Met Val Ala Val Leu Val Ser Arg Thr 
50 35 40 45 

Val Gly Pro Thr Gin Arg Leu Leu Leu Cys Gly Thr Leu Ala Ala Leu 
50 55 60 

55 His Met Leu Phe Leu Ireu Tyr Leu His Phe Ala Tyr His Lys Val Xaa 
65 70 75 80 



60 



Glu Gly He Leu Asp Thr Leu Glu Gly Pro Asn He Pro Pro He Gin 
85 90 95 
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Arg Val Pro Arg Asp lie Pro Ala Met Leu Pro Ala Ala Arg Leu Pro 
100 105 110 

Thr Thr Val Leu Asn Ala Thr Ala Lys Ala Val Ala Val Thr Leu Gin 
5 - 115 120 125 

Ser His 
130 

10 

(2) INFORMATION FOR SEQ ID NO: 607: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQXXENCE DESCRIPTION: SEQ ID NO: 607: 

20 Met Leu Val lie Phe Leu Phe Thr Ser Leu Leu Lys lie Pro Ser Ser 
15 10 15 

Val Pro Gly X^u lie Asn Val 
20 

25 



(2) INFORMATION FOR SEQ ID NO: 608: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608: 

35 

Glu Leu Asp Tyr lie Leu 
1 5 



40 

(2) INFORMATION FOR SEQ ID NO: 609: 

Ci) SEQUENCE CHARACTERICTICS : 

(A) LENGTH: 232 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609: 

Met Ala Pro Pro Gly Trp Gin Xaa Xaa Xaa Xaa Xaa Trp Leu Ala Cys 
50 1 5 10 15 

Pro Asp Arg Gly Glu Leu Ser Ser Arg Ser Pro Pro Cys Arg Leu Ala 
20 25 30 

55 Arg Trp Ala Glu Gly Asp Arg Glu Thr Arg Thr Cys Leu Leu Glu Leu 
35 40 45 

Ser Ala Gin Ser Trp Gly Gly Arg Phe Arg Arg Ser Ser Ala Val Ser 
50 55 60 

60 
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Ala Gly Ser Pro Ser Arg Leu His Phe Leu Pro Gin Pro Leu Leu Leu 
65 70 75 80 

Arg Ser Ser Gly lie Pro Ala Ala Ala Thr Pro Trp Pro Gin Pro Ala 
5 - 85 90 95 

Gly Leu E>ro Val Arg Pro Thr Pro Thr Arg Thr Gly Glu Glu Asp Arg 
100 105 110 

10 Thr Leu Asp lie Ser lie Cys Thr Glu Val Leu Ala Gly Thr Glu Gin 
115 120 125 



15 



30 



35 



Pro Pro Pro Pro Arg Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg 
130 135 140 

Leu Glu Thr Leu Asp Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg 
145 150 155 160 



Gly Lys Leu Asp Phe Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe 
20 165 170 175 

Gin Gly Glu Asp Arg Lys Phe Ala Pro Ser Asp Lys Ser Gin Pro Pro 
180 185 190 

25 Thr Thr Glu Arg Glu Gin Val Pro Val Ser Arg He Gin Thr Asp Leu 
195 200 205 



Thr Glu He Gly Ser Ser Met Arg Ser Pro Gly Val Ser Pro Arg lie 
210 215 220 

Trp Leu Asp Phe Gin Ser Thr Xaa 
225 230 



(2) INFORMATION FOR SEQ ID NO: 610: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 eunino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610: 

Met Val Leu Leu Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr lie Leu 
45 1 5 10 15 

Leu Leu Asn Met Leu lie Ala Leu Met Xaa Arg Asp Arg Gin Gin Cys 
20 25 30 

50 Arg His 



55 (2) INFORMATION FOR SEQ ID NO: 611: 

{i) SEQUENCE CHARACTERISTICS: 

(A) L^IGTH: 21 amino acids 
<B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611: 

Met Val Phe Glu Gly Phe Ser Ser Ala Phe Cys Leu Ser Ser Thr Ala 
15 10 15 

Pro Thr Ser His Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 612: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612: 

Gly Lys Lys Asn Gin Leu Leu Val He 

20 1 5 



(2) INFORMATION FOR SEQ ID NO: 613: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613: 

Met Val Trp Val Leu Trp Ser Ala Pro Ser Leu Ala Pro Pro Trp Val 
15 10 15 

35 Gly Pro Cys Trp Pro Ser Thr Gly Asn Cys Cys Leu Cys 
20 25 



40 (2) INFORMATION FOR SEQ ID NO: 614: 

(i) SEQUENCE CHARACTERISTICS: 

(A) X^ENGTH: 30 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614: 

Met Ala Lys Arg Ser Pro Gly Gly Cys Gly Ser Gly Leu lie Leu Leu 
15 10 15 

50 

Cys Cys Gin Pro Cys Arg Pro Thr Ser Ser Ala Pro Met Arg 
20 25 30 



55 

(2) INFORMATION FOR SEQ ID NO: 615: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 
60 (B) TYPE: amino acid 
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(D) TOPOliOGFY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 

He Thr He Ala He Gin Met He Cys Lreu Val Asn Xaa Glu Leu Tyr 
5-1 5 10 15 

Pro Thr Phe Val Arg Asn Xaa Gly Val Met Val Cys Ser Ser Leu Cys 
20 25 30 

10 Asp He Gly Gly He He Thr Pro Phe He Val Phe Arg Leu Arg Glu 
35 40 45 

Val Trp Gin Ala Leu Pro Leu He Leu Phe Ala Val Leu Gly Leu Leu 
50 55 60 

15 

Ala Ala Gly Val Thr Leu Leu Leu Pro Glu Thr Lys Gly Val Ala Leu 
65 70 75 80 

Pro Glu Thr Met Lys Asp Ala Glu Asn Leu Gly Arg Lys Ala Lys Pro 
20 85 90 95 

Lys Glu Asn Thr He Tyr Leu Lys Val Gin Thr Ser Glu Pro Ser Gly 
100 105 110 

25 Thr 



30 (2) INFORMATION FOR SEQ ID NO: 616: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 18 amino acids 
(B) TYPE: amino acid 
35 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616: 



40 



Thr Met Lys Asp Ala Glu Asn Leu Gly Arg Lys Ala Lys Pro Lys Glu 
1 5 . 10 15 

Asn Thr 



45 

(2) INFORMATION FOR SEQ ID NO: 617: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 
50 (B) TVPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: 

Pro Arg Val Arg Asn Ser Pro Glu Asp Leu Gly Leu Ser Leu Thr Gly 
55 1 5 10 15 

Asp Ser Cys Lys Leu 
20 

60 
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{2) INFORMATION FOR SEQ ID NO: 618: 

(i) SEQUENCE CHARACTERISTICS: 
5 ' <A) LENGTH: 52 amino acids 

(B) TVTE: amino acid 
(D) TOPOIiOGY: linear 
(xi) SEQUENCE DESCRIPTICa^ : SEQ ID NO: 618: 

10 Gin Ala Asp Asp Leu Gin Ala Thr Val Ala Ala Leu Cys Val I»eu Arg 
15 10 15 



15 



Gly Gly Gly Pro Trp Ala Gly Ser Trp Leu Ser Pro Lys Thr Pro Gly 

20 25 30 

Ala Met Gly Gly Asp Leu Val Leu Gly Leu Gly Ala Leu Arg Arg Arg 

35 40 45 



Lys Arg L.eu Leu 

20 50 



25 



(2) INFORMATION FOR SEQ ID NO: 619: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619: 

Glu Gin Glu Lys Ser Leu Ala Gly Trp Ala Leu Val L»eu Ala Xaa Xaa 
15 10 15 

35 Gly lie Gly Leu Met Val Leu His Ala Glu Met Leu Trp Phe Gly Gly 
20 25 30, 

Cys Ser Ala Val Asn Ala Thr Gly His Leu Ser Asp Thr Leu Trp Leu 
35 40 45 



40 



55 



He Pro He Thr Phe Leu Thr He Gly Tyr Gly Asp Val Val Pro Gly 
50 55 60 



Thr Met Trp Gly Lys He Val Cys Leu Cys Thr Gly Val Met Gly Val 
45 65 70 75 80 

Cys Cys Thr Ala Leu Leu Val Ala Val Val Ala Arg Lys Leu Glu Phe 
85 90 95 

50 Asn Lys Ala Glu Lys His Val His Asn Phe Met Met Asp He Gin Tyr 
100 105 110 



Thr Lys Glu Met Lys Glu Ser Ala Ala Arg Val Leu Gin Glu Ala TiTp 
115 120 125 

Met Phe Tyr Lys His Thr Arg Arg Lys Glu Ser His Ala Ala Arg Xaa 
130 135 140 



His Gin Arg Xaa Leu Leu Ala Ala He Asn Ala Phe Arg Gin Val Arg 
60 145 150 155 160 
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Leu Lys His Arg Lys Leu Arg Glu Gin Val Asn Ser Met Val Asp He 
165 170 175 

Ser Lys Met His Met He Leu Tyr Asp Leu Gin Gin Asn Leu Ser Ser 
180 185 190 

Ser His Arg Ala Leu Glu Lys Gin He Asp Thr Leu Ala Gly Lys Leu 
195 200 205 

Asp Ala Leu Thr Glu Leu Leu Ser Thr Ala Leu Gly Pro Arg Gin Leu 
210 215 220 

Pro Glu Pro Ser Gin Gin Ser Lys 
225 230 



(2) INFORMATION FOR SEQ ID NO: 620: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEJK3TH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620: 

Tyr Gin Ala His His Val Ser Arg Asn Lys Arg Gly Gin Val Val Gly 
15 10 15 

Thr Arg Gly Gly Phe Arg Gly Cys Thr Val Trp Leu Thr Gly Leu Ser 
20 25 30 

Gly Ala Gly Lys 
35 

\ 



(2) INFORMATION FOR SEQ ID NO: 621: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621: 

Leu Gin Cys Glu He Cys Gly Phe Thr Cys Arg Gin Lys Ala Ser Leu 
1 5 10 15 

Asn Trp His Met Lys Lys His Asp Ala Asp Ser Phe Tyr Gin Phe Ser 
20 25 30 

Cys Asn He Cys Gly Lys Lys Phe Glu Lys Lys Asp Ser Val Val Ala 
35 40 45 

His Lys Ala Lys Ser His Pro Glu Val 
50 55 



(2) INFORMATION FOR SEQ ID NO: 622: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 622: 

lie Thr Ser Thr Asp lie Leu Gly Thr Asn Pro Glu Ser Leu Thr Gin 
15 10 15 

Pro Ser Asp 



(2) INFORMATION FOR SEQ ID NO: 623: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623: 

Asn Ser Thr Ser Gly Glu Cys Leu Leu Leu Glu Ala Glu Gly Met Ser 
15 10 15 

Lys Ser Tyr 



(2) INFORMATION FOR SEQ ID NO: 624: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 624: 

Cys Ser Gly Thr Glu Arg Val Ser Leu Met Ala Asp Gly Lys He Phe 
15 10 15 

Val Gly Ser Gly Ser Ser Gly Gly Thr Glu Gly Leu Val Met Asn Ser 
20 25 30 

Asp He Leu Gly Ala Thr Thr Glu Val Leu He Glu Asp Ser Asp Ser 
35 40 45 

Ala Gly Pro 
50 



(2) INFORMATION FOR SEQ ID NO: 625: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 625: 
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He Gin Tyr Val Arg Cys Glu Met Glu Gly Cys Gly Thr Val Leu Ala 
1 5 10 15 

5 ~ His Pro Arg Tyr Leu Gin His His lie Lys Tyr Gin His Leu Leu Lys 
20 25 30 



10 



15 



30 



45 



55 



Lys Lys Tyr Val Cys Pro His Pro Ser Cys Gly Arg Leu Phe Arg Leu 
35 40 45 

Gin Lys Gin Leu Leu Arg His Ala Lys His His Thr 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 626: 



(i) SEQUENCE CHARACTERISTICS: 
- (A) LENGTH: 31 amino acids 

20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626: 

Asp Gin Arg Asp Tyr lie Cys Glu Tyr Cys Ala Arg Ala Phe Lys Ser 
25 1 5 10 15 

Ser His Asn Leu Ala Val His Arg Met lie His Thr Gly Glu Lys 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 627: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627: 

40 Arg Ser Ser Arg Ser Lys Thr Gly Ser Leu Gin Leu lie Cys Lys Ser 
15 10 15 



Glu Pro Asn Thr Asp Gin Leu Asp Tyr 
20 25 



(2) INFORMATION FOR SEQ ID NO: 628: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628: 



Leu Gin Cys Glu lie Cys Gly Phe Thr Cys Arg Gin Lys Ala Ser Leu 
15 10 15 



, Asn Trp His Met Lys Lys His Asp Ala Asp Ser Phe Tyr Gin Phe Ser 
60 20 25 30 
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Cys Asn lie Cys Gly Lys Lys Phe Glu Lys Lys Asp Ser Val Val Ala 
35 40 45 

His Lys Ala Lys Ser His Pro Glu Val Xaa lie Thr Ser Thr Asp lie 
50 55 60 

Leu Gly Thr Asn Pro Glu Ser Leu Thr Gin Pro Ser Asp Xaa Asn Ser 
65 70 75 80 

Thr Ser Gly Glu Cys Leu Leu Leu Glu Ala Glu Gly Met Ser Lys Ser 
85 90 95 

Tyr Xaa Cys Ser Gly Thr Glu Arg Val Ser Leu Met Ala Asp Gly Lys 
100 105 110 

He Phe Val Gly Ser Gly Ser Ser Gly Gly Thr Glu Gly Leu Val Met 
115 120 125 

Asn Ser Asp He Leu Gly Ala Thr Thr Glu Val Leu He Glu Asp Ser 
130 135 140 

Asp Ser Ala Gly Pro Xaa Gin Arg Asp Tyr He Cys Glu Tyr Cys Ala 
145 150 155 160 

Arg Ala Phe Lys Ser Ser His Asn Leu Ala Val His Arg Met He His 
165 170 175 

Thr Gly Glu Lys His Tyr Xaa 
180 



(2) INFORMATION FOR SEQ ID NO: 629: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629: 



Gin Tyr Val Arg Cys Glu Met Glu 
1 5 

Pro Arg Tyr Leu Gin His His He 
20 

Lys Tyr Val Cys Pro His Pro Ser 
35 40 

Lys Gin Leu Leu Arg His Ala Lys 
50 55 



Gly Cys Gly Thr Val Leu Ala His 
10 15 

Lys Tyr Gin His Leu Leu Lys Lys 
25 30 

Cys Gly Arg Leu Phe Arg Leu Gin 
45 

His His Thr Asp 
60 



(2) INFORMATION FOR SEQ ID NO; 630: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630: 

Pro Phe Lys Asp Asp Pro Arg Asp Glu Thr Tyr Lys Pro His Leu Glu 
15 10 15 

Arg Glu Thr Pro Lys Pro Arg Arg Lys Ser Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 631: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631: 

Glu Met Phe Asp Ser Leu Ser Tyr Phe Lys Gly Ser Ser Leu Leu Leu 
15 10 15 

Met Leu Lys Thr Tyr Leu Ser Glu Asp Val Phe Gin His Ala Val Val 
20 25 30 

Leu Tyr Leu His Asn His Ser Tyr Ala Ser lie Gin Ser Asp Asp Leu 
35 40 45 

Trp Asp Ser Phe Asn Glu Val Thr Asn Gin Thr Leu Asp Val Lys Arg 
50 55 60 

Met Met Lys Thr Trp Thr Leu Gin Lys Gly Phe Pro Leu Val Thr Val 
65 70 75 80 

Gin Lys Lys Gly Lys Glu Leu Phe lie Gin Gin Glu Arg Phe Phe Leu 
85 90 95 

Asn Met Lys Pro Glu lie Gin Pro Ser Asp Thr Arg Tyr Met 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 632: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B> TTPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632: 

Leu Glu Lys Val Ala Ser Val Gly Asn Ser Arg Pro Thr Gly Gin Gin 
15 10 15 

Leu Glu Ser Leu Gly Leu Leu Ala 
20 



(2) INFORMATION FOR SEQ ID NO: 633: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 633: 

Val His Arg Glu Glu Ala Ser Cys Tyr Cys Gin Ala Glu Pro Ser Gly 
15 10 15 

Asp Leu 



<2) INFORMATION FOR SEQ ID NO: 634: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634: 

Arg Pro Ala Leu Arg Gin Ala Gly Gly Gly Thr Arg Glu Pro Arg Gin 
15 10 15 

Lys Arg Trp Ala Gly Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 635: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635: 

Ala Val Asn Phe Arg Pro Gin Arg Ser Gin Ser Met 
15 10 



(2) INFORMATION FOR SEQ ID NO: 636: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICai: SEQ ID NO: 636: 

Met lie Thr Asp Val Gin Leu Ala He Phe Ala Asn Met Leu Gly Val 
1 5 10 15 

Ser Leu Phe Leu Leu Val Val Leu Tyr His Tyr Val Ala Val Asn Asn 
20 25 30 

Pro Lys Lys Gin Glu 
35 
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(2) INFORMATION FOR SEQ ID NO: 637; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 637: 

Glu Glu Met Ala Asp Ser Val Lys Thr Phe Leu Gin Asp Leu Ala Arg 
15 10 15 

Gly He Lys Asp Ser He Trp Gly He Cys Thr He Ser Lys Leu Asp 
20 25 30 

Ala Arg He Gin Gin Lys Arg Glu Glu Gin Arg Arg Arg Arg Ala Ser 
35 40 45 

Ser Val Leu Ala Gin Arg Arg Ala Gin Ser He Glu Arg Lys Gin Glu 
50 55 60 

Ser Glu Pro Arg He Val Ser Arg He Phe Gin Cys Cys Ala Trp Asn 
65 70 75 80 

Gly Gly Val Phe Trp Phe Ser Leu Leu Leu Phe Tyr Arg Val Phe He 
85 90 95 

Pro Val Leu Gin Ser Val Thr Ala Arg He He Gly Asp Pro Ser Leu 
100 105 110 

His Gly Asp Val Trp Ser Trp Leu Glu Phe Phe Leu Thr Ser He Phe 
115 120 125 

Ser Ala Leu Trp Val Leu Pro Leu Phe Val Leu Ser Lys Val Val Asn 
130 135 140 

Ala He Trp Phe Gin Asp He Ala Asp Leu Ala Phe Glu Val Ser Gly 
145 150 155 160 

Arg Lys Pro His Pro Phe Pro Ser Val Ser Lys He He Ala Asp Met 
165 170 175 

Leu Phe Asn Leu Leu Leu Gin Ala Leu Phe Leu He Gin Gly Met Phe 
180 185 190 

Val Ser Leu Phe Pro He His Leu Val Gly Gin Leu Val Ser Leu Leu 
195 200 205 

His Met Ser Leu Leu Tyr Ser Leu Tyr Cys Phe Glu Tyr Arg Trp Phe 
210 215 220 

Asn Lys Gly He Glu Met His Gin Arg Leu Ser Asn He Glu Arg Asn 
225 230 235 240 

Trp Pro Tyr Tyr Phe Gly Phe Gly Leu Pro Leu Ala Phe Leu Thr Ala 
245 250 255 

Met Gin Ser Ser Tyr He He Ser Gly Cys Leu Phe Ser He Leu Phe 
260 265 270 
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Pro Leu Phe lie lie Ser Ala Asn Glu Ala Lys Thr Pro Gly Lys Ala 
275 280 285 

5 - Tyr Leu Phe Gin Leu Arg Leu Phe Ser Leu Val Val Phe Leu Ser Asn 
290 295 300 

Arg Leu Phe His Lys Thr Val Tyr Leu Gin Ser Ala Leu Ser Ser Ser 
305 310 315 320 

10 

Thr Ser Ala Glu Lys Phe Pro Ser Pro His Pro Ser Pro Ala Lys Leu 
325 330 335 

Lys Ala Thr Ala Gly His 
15 340 



20 



(2) INFORMATION FOR SEQ ID NO: 638: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 amino acids 
(B> TYPE: amino acid 
(D) TOPOLOGY: linear 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638: 

Met Ala Lys Phe Met Thr Pro Val lie Gin Asp Asn Pro Ser Gly Trp 
15 10 15 

30 Gly Pro Cys Ala Val Pro Glu Gin Phe Arg Asp Met Pro Tyr Gin Pro 
20 25 30 



35 



50 



Phe Ser Lys Gly Asp Arg Leu Gly Lys Val Ala Asp Trp Thr Gly Ala 
35 40 45 

Thr Tyr Gin Asp Lys Arg Tyr Thr Asn Lys Tyr Ser Ser Gin Phe Gly 
50 55 60 



Gly Gly Ser Gin Tyr Ala Tyr Phe His Glu Glu Asp Glu Ser Ser Phe 
40 65 70 75 80 

Gin Leu Val Asp Thr Ala Arg Thr Gin Lys Thr Ala Tyr Gin Arg Asn 
85 90 95 

45 Arg Met T^g Phe Ala Gin Arg Asn Leu Arg Arg Asp Lys Asp Arg Arg 
100 105 110 



Asn Met Leu Gin Phe Asn Leu Gin lie Leu Pro Lys Ser Ala Lys Gin 
115 120 125 

Lys Glu Arg Glu Arg lie Arg Leu Gin Lys Lys Phe Gin Lys Gin Phe 
130 135 140 



Gly Val Arg Gin Lys Trp Asp Gin Lys Ser Gin Lys Pro Arg Asp Ser 
55 145 150 155 160 

Ser Val Glu Val Arg Ser Asp Trp Glu Val Lys Glu Glu Met Asp Phe 
165 170 175 

60 Pro Gin Leu Met Lys Met Arg Tyr Leu Glu Val Ser Glu Pro Gin Asp 
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180 185 190 

lie Glu Cys Cys Gly Ala Leu Glu Tyr Tyr Asp Lys Ala Phe Asp Arg 
195 200 205 

5 - 

He Thr Thr Arg Ser Glu Lys Pro Leu Arg Xaa Xaa Lys Arg He Phe 
210 215 220 

His Thr Val Thr Thr Thr Asp Asp Pro Val He Arg Lys Leu Ala Lys 
10 225 230 235 240 

Thr Gin Gly Asn Val Phe Ala Thr Asp Ala He Leu Ala Thr Leu Met 
245 250 255 

15 Ser Cys Thr Arg Ser Val Tyr Ser Trp Asp He Val Val Gin Arg Val 
260 265 270 



20 



35 



50 



Gly Ser Lys Leu Phe Phe Asp Lys Arg Asp Asn Ser Asp Phe Asp Leu 
275 280 285 

Leu Thr Val Ser Glu Thr Ala Asn Glu Pro Pro Gin Asp Glu Gly Asn 
290 295 300 



Ser Phe Asn Ser Pro Arg Asn Leu Ala Met Glu Ala Thr Tyr He Asn 
25 305 310 315 320 

His Asn. Phe Ser Gin Gin Cys Leu Arg Met Gly Lys Glu Arg Tyr Asn 
325 330 335 

30 Phe Pro Ash Pro Asn Pro Phe Val Glu Asp Asp Met Asp Lys Asn Glu 
340 345 350 



He Ala Ser Val Ala Tyr Arg Tyr Arg Ser Gly Lys Leu Gly Asp Asp 
355 360 365 

He Asp Leu He Val Arg Cys Glu His Asp Gly Val Met Thr Gly Ala 
370 375 380 



Asn Gly Glu Val Ser Phe He Asn He Lys Thr Leu Asn Glu Trp Asp 
40 385 390 395 400 

Ser Arg His Cys Asn Gly Val Asp Trp Arg Gin Lys Leu Asp Ser Gin 
405 410 415 

45 Arg Gly Ala Val He Ala Thr Glu l^eu Lys Asn Asn Ser Tyr Lys Leu 
420 425 430 



Ala Arg Trp Thr Cys Cys Ala Leu Leu Ala Gly Ser Glu Tyr Leu Lys 
435 440 445 

Leu Gly Tyr Val Ser Arg Tyr His Val Lys Asp Ser Ser Arg His Val 
450 455 460 



He Leu Gly Thr Gin Gin Phe Lys Pro Asn Glu Phe Ala Ser Gin He 
55 465 470 475 480 

Asn Leu Ser Val Glu Asn Ala Trp Gly He Leu Arg Cys Val He Asp 
485 490 495 

60 He Cys Met Lys Leu Glu Glu Gly Lys Tyr Leu He Leu Lys Asp Pro 
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500 



505 



510 



Asn Lys Gin Val lie Arg Val Tyx Ser Leu Pro Asp Gly Thr Phe Ser 
515 520 525 



Ser 



(2) INFORMATION FOR SEQ ID NO: 639: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639: 

Lys Lys Arg His Thr Asp Val Gin Phe Tyr Thr Glu Val Gly Glu lie 
15 10 15 

Thr Thr Asp Leu Gly Lys His Gin His Met His Asp Arg Asp Asp Leu 
20 25 30 

Tyr Ala Glu Gin Met Glu Arg Glu Met Arg His Lys Leu Lys Thr Ala 
35 40 45 

Phe Lys Asn Phe lie Glu Lys Val Glu Ala Leu Thr Lys Glu Glu Leu 
50 55 60 

Glu Phe Glu Val Pro Phe Arg Asp Leu Gly Phe Asn Gly Ala Pro Tyr 
65 70 75 80 

Arg Ser Thr Cys Leu Leu Gin Pro Thr Ser Ser Ala Leu Val Asn Ala 
85 90 95 

Thr Glu Trp Pro Pro Phe Val Val Thr Leu Asp Glu Val Glu Leu lie 
100 105 110 

His Phe Xaa Arg Val Gin Phe His Leu Lys Asn Phe Asp Met Val lie 
115 120 125 

Val Tyr Lys Asp Tyr Ser Lys Lys Val Thr Met lie Asn Ala lie Pro 
130 135 140 

Val Ala Ser Leu Asp Pro lie Lys Glu Trp Leu Asn Ser Cys Asp Leu 
145 150 155 160 

Lys Tyr Thr Glu Gly Val Gin Ser Leu Asn Trp Thr Lys lie Met Lys 
165 170 175 

Thr lie Val Asp Asp Pro Glu Gly Phe Phe Glu Gin Gly Gly Trp Ser 
180 185 190 

Phe Leu 



(2) INFORMATION FOR SEQ ID NO: 640: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
5 - (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640: 

Arg Ser Gly Leu Gly Leu Gly He Thr He Ala Phe Leu Ala Thr Leu 
15 10 15 

10 

He Thr Gin Phe Leu Val Tyr Asn Gly Val Tyr Gin Tyr Thr Ser Pro 
20 25 30 

Asp Phe Leu Tyr He Arg Ser Trp Leu Pro Cys He Phe Phe Ser Gly 
15 35 40 45 

Gly Val Thr Val Gly Asn He Gly Arg Gin Leu Ala Met Gly Val Pro 
50 55 60 

20 Glu Lys Pro His Ser Asp 
65 70 



25 (2) INFORMATION FOR SEQ ID NO: 641: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641: 



35 



50 



55 



Val Thr Gin Pro Lys His Leu Ser Ala Ser Met Gly Gly Ser Val Glu 
1 5 10 15 

He Pro Phe Ser Phe Tyr Tyr Pro Trp Glu Leu Ala Xaa Xaa Pro Xaa 
20 25 30 



Val Arg He Ser Trp Arg Arg Gly His Phe His Gly Gin Ser Phe Tyr 
40 35 40 45 

Ser Thr Arg Pro Pro Ser He His Lys Asp Tyr Val Asn Arg Leu Phe 
50 55 60 

45 Leu Asn Trp Thr Glu Gly Gin Glu Ser Gly Phe Leu Arg He Ser Asn 
65 70 75 80 



Leu Arg Lys Glu Asp Gin Ser Val Tyr Phe Cys Arg Val Glu Leu Asp 
85 90 95 

Thr Arg Arg Ser Gly 
100 



(2) INFORMATION FOR SEQ ID NO: 642: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 
60 (B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642: 

Met Glu Ala Gin Gin Val Asn Glu Ala Glu Ser Ala Arg Glu Gin Leu 
15 10 15 

Gin Xaa Leu His Asp Gin lie Ala Gly Gin Lys Ala Ser Lys Gin Glu 
20 25 30 

Leu Glu Thr Glu Leu Glu Arg Leu Lys Gin Glu Phe His Tyr lie Glu 
35 40 45 

Glu Asp Leu Tyr Arg Thr Lys Asn Thr Leu Gin Ser Arg lie Lys Asp 
50 55 60 

Arg Asp Glu Glu lie Gin Lys L»eu Arg Asn Gin Leu Thr Asn Lys Thr 
65 70 75 80 

Leu Ser Asn Ser Ser Gin Ser Glu Leu Glu Asn Arg Leu His Gin Leu 
85 90 95 

Thr Glu Thr Leu lie Gin Lys Gin Thr Met Leu Glu Ser Leu Ser Thr 
100 105 110 

Glu Lys Asn Ser Leu Val Phe Gin Leu Glu Arg Leu Glu Gin Gin Met 
115 ' 120 125 

Asn Ser Ala Ser Gly Ser Ser Ser Asn Gly Ser Ser lie Asn Met Ser 
130 135 140 

Gly lie Asp Asn Gly Glu Gly Thr Arg Leu Arg Asn Val Pro Val Leu 
145 150 155 160 

Phe Asn Asp Thr Glu Thr Asn Leu Ala Gly Met Tyr Gly Lys Val Arg 
165 170 175 

Lys Ala Ala Ser Ser lie Asp Gin Phe Ser lie Arg Leu Gly lie Phe 
180 185 190 

Leu Arg Arg Tyr Pro lie Ala Arg Val Phe Val lie lie Tyr Met Ala 
195 200 205 

Leu Leu His Leu Trp Val Met lie Val Leu Leu Thr Tyr Thr Pro Glu 
210 215 220 

Met His His Asp Gin Pro Tyr Gly Lys 
225 230 



(2) INFORMATI(»I FOR SEQ ID NO: 643: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643: 

lie Arg His Glu Gin His Pro Asn Phe Ser Leu Glu Met His Ser Lys 
1 5 10 15 
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Gly Ser Ser Leu Leu Leu Phe Leu Pro Gin Leu lie Leu lie Leu Pro 
20 25 30 

5 - Val Cys Ala His Leu His Glu Glu Leu Asn Cys 
35 40 



10 (2) INFORMATION FOR SEQ ID NO: 644: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 



20 



30 



Ser Phe Phe lie Ser Glu Glu Lys Gly His Leu Leu Leu Gin Ala Glu 
15 10 15 

Arg His Pro Trp Val Ala Gly Ala Leu Val Gly Val Ser Gly Gly Leu 
20 25 30 



Thr Leu Thr Thr Cys Ser Gly Pro Thr Glu Lys Pro Ala Thr Lys Asn 
25 35 40 45 



Tyr Phe Leu Lys Arg Leu Leu Gin Glu Met His lie Arg Ala Asn 
50 55 60 



35 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 116 . line N/A ^^^^ 



B, IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q] 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Pariclawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 97897 



C. ADDITIONAL INDICATIONS Oeave blank if not applicable) This information is continued on an additional sheet |JJ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (iftheindicaiiomare not for aU designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., "Accession 
Number of Deposit*^ 



For receiving Office use only , 



This sheet was received with the tntcmational application 



Authorized ofTica- 



Susan White 

PCT IntematJonal Dfvfston 



i For International Bureau use only 



□ 



This sheet was received by the Internationa] Bureau on: 



Authorized officer 



zr^.. tw^/nrxri 14 '*>j|y 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 136«; 



A. The indications made below relate to the microorganism referred to in the descnption 
on page 1 1 6 , line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits arc identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15, 1997 



Accession Number 209043 



C. ADDITIONAL INDICATIONS f7eave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications arc not for aU designated Staies) 



E. SEPARATE FURNISHING OF INDICATIONS Geave blank if not applicable) 



The indications listed below wilt be submitted to the International Bureau later {specify the general nature of the indications, e.g., ''Accession 
Number of Deposit'') 



For receiving Office use only . 



^^I^^j ^is sheet was received with the international application 



Authorized officer 



Susan White 

PCT International DMston 



> For International Bureau use only 



□ 



This sheet was received by the Internationa] Bureau on: 



Authorized ofTicer 



ulyl992) 

tSDOCID: <WO_9839448A2_tA> 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 136 



A, The indications made below relate to the microorganisTn referred to in the description 
on page 119 , line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit September 4, 1997 



Accession Number 209235 



C. ADDITIONAL INDICATIONS (7eave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE Of *l^e indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., '^Accession 
Number of Deposit*^ 



For receiving Office use only . 



This sheet was received with the rntemational application 



Authorized officer 



Sinan White 

PCT IntematJona? Dlviston 



t For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



c Tw^«,^„ /,„,y J992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 136^; 



A. The indications made below relate to the microorganism referred to in the description 
on page 122 , line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 97898 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for ail designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specif the general nature of the indications, e.g., "Accession 
Number of Deposit*^ 



For receiving Office use only . 



■HI' 



niiis sheet was received with the intemational s^plication 



Authorized officer 



Susan WhKe 

PCT Intemational Dlvlston 



• For Intemational Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



c tw^/o/N/i-k^ /|ujy 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 13 



A. The indications made below relate to the microorganism referred to in the description 
on page 122 .line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15, 1997 



Accession Number 209044 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This infomiation is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (If the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., "Accession 
Number of Deposit") 



For receiving Office use only , 




-This sheet was received with the international application 



Authorized officer 



Sosan White 

PCT Intemattonal DhriskMi 



• For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



iM^/orMf^j 'July 1992) 

50000: <WO ^9839448A2LIA> ' 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

. (PCX Rule 136/5; 



A. The indications made below relate to the microorganism referred to in the description 
on page 126 , line N/A 



B, IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 97899 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q] 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for aii designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., '^Accession 
Number of Deposit") 



For receiving Office use only , 



This sheet was received with the international application 



Authorized officer 


Suaan White 




Per International Division 



I For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



rm/%nn4/f.,|y 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 13^/5; 



A. The indications made below relate to the microorganism referred to in the description 
on page 126 .line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15, 1997 



Accession Number 209045 



C. ADDITIONAL INDICATIONS (l^ove blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (i/the indications are noi /or ali designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 

The indications listed below will be submitted to the International Bureau later {specif the general nature of the irtdications, e.g., "Accession 
Number of Deposit"^ 



For receiving Office use only , 



sheet was received with the international application 



Authorized officer 






Susan White 




POT Intemattenal Division 



• For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



c ww-^i«^»,^ ,,..,y ,992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 130 , line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit April 28, 1 997 



Accession Number 20901 1 



C ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicaiions are not for ail designated States) 



E. SEPARATE FURNISHING OF INDICATIONS Oeave blank not applicable) 



The indications listed below will be submitted to the international Bureau later {specijy the general nature of the indications, e.g.. "Accession 
Number of Deposit^ 



For receiving Office use only . 



This sheet was received with the international application 



Authorized officer 



Susan White 

Per Imemaftenal DMston 



» For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



SOOCID: <WO_983944aA^IA> 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuIe 13^>/5; 



A. The indications made below relate to the microorganism referred to in the description 
on page 131 , line N/A 



B, IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an zidditional sheet Q 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 


Accession Number 97900 


C, ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated Suaes) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specif the general nature of the indications, e,g.. "Accession 
Number of Deposit") 



For receiving Office use only , 



This sheet was received with the intemattonal application 



Authorized officer 



gunn White 

PCT Intemattonal OWlatan 



» For International Bureau use only - 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



Iv 1992> 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 136w; 



A. The indications made below relate to the microorganisni referred to in the description 
on page 137 , line N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits arc identified on an additional sheet [~] 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 9790 1 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D, DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicaiions are not for aU designated States) 



E. SEPARATE FURNISHING OF INDICATIONS Geave blank if not applicable) 



li^Zf^D^^^^^"^ ^""'^"^ "^'^ submitted to the International Bureau later {specify the general nature of the indications, e.g. ^Accession 



For receiving Office use only , 



This sheet was received with the international application 



Authorized officer 



Susan White 

PCT International Division 



SDOCID: <WO 9839448Aa_IA> 



• For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 13^^/^; 



A. The indications made below relate to the microorganism referred to in the description 
on page 131 , line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15, 1997 



Accession Number 209046 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicaiions are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications, e.g., "Accession 
Numlfer of Deposit*^ 



For receiving Office use only , 



This sheet was received with the intemattonal application 



Authorized officer 


Sunn White 




Per Intemaflonal Division 



> For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



ily 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 136w; 



A. The indications made below relate to the microorganism referred to in the description 
on page 137 , line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositaiy institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15, 1997 


Accession Number 209047 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet | [ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE Of the indications are not for aU designated States) 



E. SEPARATE FURNISHING OF INDICATIONS Oeave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., "Accession 
Number of Deposit*^ 



For receiving Office use only , 



sheet was received with the international application 



Authorized officer SUSSM White 

per Intemettonal Division 



" For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



SDOCID: <WO_9839448A2_IA> 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 13Z)«; 



A. The indications made below relate to the microorganism referred to in the description 
on page 137 .line N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits arc identified on an additional sheet [_] 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 22, 1 997 


Accession Number 209076 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications arena for all designated States} 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., '^Accession 
Number of Deposit") 



For receiving Office use only , 



This sheet was received with the international application 



Authorized officer 


Susan White 




PCT International Division 



• For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



'jly 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule I3£>«; 

A. The indications made below relate to the microorganism referred to in the description 

on page 140 . line N/A 

B. IDENTIFICATION OF DEPOSIT Further deposits arc identified on an additional sheet Q 
Name of depositary institution 

American Type Culture Collection 

Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 « 
United States of America 



Date of deposit August 21,1997 


Accession Mumber 2092 1 5 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicadons are not for aU designated States) 



E. SEPARATE FURNISHING OF INDICATIONS Oeave blank if not applicable) 



The indications listed below wilt be submitted to the International Bureau later {specify the general nature of the indications, e.g., "Accession 
Number of Deposit") 



For receiving Office use only , 



or 



is sheet was received with the intemationa) application 



Authorized officer 



Susan Whito 

POT Internatloaal Dfvlslon 



> For international Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 13^/5; 



A. The indications made below relate to the microorganism referred to in the description 
on page 160 , line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 97904 



ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet [~] 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for ati designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e,g„ '^Accession 
Number of Deposit*^ 



For receiving Office use only , 



This sheet was received with ihc international application 



Authorized officer Susan White 

PCT Intemerftonal Dfvfsfop 



1 For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



SDOaO: <WO_da39448A:?_lA> .July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuie \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 154 , line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits arc identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
RockviUe, Maryland 20852 
United States of America 



Date of deposit July 3, 1997 



Accession Number 209 1 39 



C. ADDITIONAL INDICATIONS f leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (i/theindicaiions are not/or ait designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specif the general nature of the indications, e.g., '^Accession 
Number of Deposif^ 



For receiving Office use only , 



This sheet was received with the international application 



Authorized ofTicer 



Susan Whita 

PCT Intematfonal DMston 



• For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



SDOCiD: <WO_98©44aA2JA> July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuie nz^w; 



A. The indications made below relate lo the microorganism referred to in the description 
on page 153 , line N/A 



B, IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15, 1997 



Accession Number 209049 



C. ADDITIONAL INDICATIONS {leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e,g„ "Accession 
Number of Deposit*') 



For receiving Office use only , 



0™ 



This sheet was received with the international application 



Authorized officer 



Susan Whtte 

per Intemattonal Division 



• For International Bureau use only 



□ 



This sheei was received by the tniemaiional Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the microorganism refcrTcd to in the description 
on page 153 .line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26. 1997 



Accession Number 97903 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for ali designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specif the general nature of the indications, e.g.. '^Accession 
Number of Deposit") 



For receiving Office use only , 



This sheet was received with the international application 



Authorized officer 






Susan White 




PCT tntomotional Division 



' For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized ofTtccr 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule I3^>«r; 



A. The indications made below relate to the microorganism referred to in the description 
on page 142 » line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit June 12, 1997 



Accession Number 209 1 1 9 



C. ADDITIONAL INDICATIONS {leave blank ijnot applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 

The indications listed below will be submitted to the international Bureau later {specijy the general nature of the indications, e.g., **Accession 
Number of Deposit*^ 



For receiving Office use only , 



his sheet was received wiih the international application 



Authorized officer 


Gusen White 




POT tfTtematkina! Division 



' For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized offtcer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 136/5; 



A. The indications made below relate to the microorganism referred to in the description 
on page 146 . line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q] 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville. Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 97902 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for atl designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g.. "Accession 
Number of Deposit'*) 



For receiving Office use only . 



mi 



This sheet was received with the international application 



Authorized officer SUSftH WHttB TZZZIIZ 

PCT Intemattonal OMslon 



► For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 



A. The indications made below relate to the microorganism referred to in the description 
on page 146 » N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet H] 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15, 1997 



Accession Number 209048 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet |_] 



!>•_ DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE/Z/rA^ indications are not/or all designated Stat^) 



E, SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specif the general nature of the indications, e.g., ** Accession 
Number of Deposit") 



For receiving Office use only , 



is sheet was received with the international application 



Authorized ofTicer 



Susan Whito 

per intdm&tlonat DMftlon 



' For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 1 3^/5; 

A. The indications made below relate to the microorganism referred to in the description 

on page 160 . line WA 

B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 
Name of depositary institution 

American Type Culture Collection 

Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15, 1997 


Accession Number 209050 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designaied States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., Accession 
Number of Deposit") 



For receiving Office use only , 



EC' 



This sheet was received with the international application 



Authorized offtcer 



Susan White 

PCT tmemattonal DMslon 



• For International Bureau use only 



□ 



This sheet was received by the Inccmattonal Bureau on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule 13^/5; 



A. The indications made below relate to the microorganism referred to in the description 
on page 142 , line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet ^ 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Pariclawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 12, 1998 



Accession Number 209627 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This infonmation is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS Oeave blank if not applicable) 

The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., "Accession 
Number of Deposif^ 



For receiving Office use only , 



This sheet was received with the international application 



Authorized officer 


Susan White 




PCT IntBTtrattonftl DMslon 



. For international Bureau use only 



□ 



This sheet was received by the Internationa] Bureau on: 



Authorized officer 



5DOC1D: <WO 983944aA2 IA> 



I (July 1992) 



wo 98/39448 



PCT/US98/04493 



743 



What Is Claimed Is: 

1 . An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence selected from the group 

5 consisting of : 

(a) a polynucleotide fragment of SEQ ID NO:X or a polynucleotide fragment of 
the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID 
NO:X; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID NO: Y or a 

10 polypeptide fragment encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(c) a polynucleotide encoding a polypeptide domain of SEQ ID NO: Y or a 
polypeptide domain encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

15 (d) a polynucleotide encoding a polypeptide epitope of SEQ ID NO:Y or a 

polypeptide epitope encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(e) a polynucleotide encoding a polypeptide of SEQ ID NO: Y or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X, 

20 having biological activity; 

(f) a polynucleotide which is a variant of SEQ ID NO:X; 

(g) a polynucleotide which is an aUelic variant of SEQ ID NO:X; 

(h) a polynucleotide which encodes a species hbrhologiie of the SEQ ID NO:Y; 

(i) a polynucleotide capable of hybridizing under stringent conditions to any 
25 one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not 

hybridize under stringent conditions to a nucleic acid molecule having a nucleotide 
sequence of only A residues or of only T residues. 

2 . The isolated nucleic acid molecule of claim 1 , wherein the 

30 polynucleotide fragment comprises a nucleotide sequence encoding a secreted protein. 

3 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding the sequence 
identified as SEQ ID NO:Y or the polypeptide encoded by the cDNA sequence included 

35 in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X. 
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4 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises the entire nucleotide sequence of SEQ ID NO:X or 
the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID 
NO:X. 

5 

5 . The isolated nucleic acid molecule of claim 2, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

10 6 . The isolated nucleic acid molecule of claim 3, wherein the nucleotide 

sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

7. A recombinant vector comprising the isolated nucleic acid molecule of 
15 claim 1. 

8. A method of making a recombinant host cell comprising the isolated 
nucleic acid molecule of claim 1 . 

20 9. A recombinant host cell produced by the method of claim 8. 

1 J.^^J?^9I^}P?^\ l^^st cell of claim 9 comprising vector sequences. 

1 1 . An isolated polypeptide comprising an amino acid sequence at least 95% 
25 identical to a sequence selected from the group consisting of: 

(a) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No:Z; 

(b) a polypeptide fragment of SEQ ID NO:Y or the encoded sequence included 
in ATCC Deposit NorZ, having biological activity; 

30 (c) a polypeptide domain of SEQ ID NO: Y or the encoded sequence included in 

ATCC Deposit No:Z; 

(d) a polypeptide epitope of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(e) a secreted form of SEQ ID NO:Y or the encoded sequence included in 
35 ATCC Deposit No:Z; 

(0 a full length protein of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; 



10 



20 



WO 98/39448 PCT/US98/04493 

745 

(g) a variant of SEQ ID NO: Y; 

(h) anallelic variantof SEQIDNO:Y;or 

(i) a species homologue of the SEQ ID NO: Y. 

12. The isolated polypeptide of claim 1 1, wherein the secreted form or the 
full length protein comprises sequential amino acid deletions from either the C-terminus 
or the N-terminus. 

13. An isolated antibody that binds specifically to the isolated polypeptide of 
claim 1 1 . 

14. A recombinant host cell that expresses the isolated polypeptide of claim 

11. 



15. A method of making an isolated polypeptide comprising: 
15 (a) culturing the recombinant host cell of claim 14 under conditions such that 

said polypeptide is expressed; and 

(b) recovering said polypeptide. 



1 6. The polypeptide produced by claim 15. 

17. A method for preventing, treating, or ameliorating a medical condition, 
comprising administering to a mammalian subject a therapeutically effective amount of 
the polypeptide of clairn 1 1 or the polynucleotide of claim 1 . 

25 , 18. A method of diagnosuig a pathological condition or a susceptibility to a 

pathological condition in a subject comprising: 

(a) determining the presence or absence of a mutation in the polynucleotide of 
claim 1; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
30 condition based on the presence or absence of said mutation. 

19. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or amount of expression of the polypeptide of 
35 claim 1 1 in a biological sample; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or amount of expression of the polypeptide. 
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20. A method for identifying a binding partner to the polypeptide of claim 1 1 
comprising: 

(a) contacting the polypeptide of claim 1 1 with a binding partner; and 

(b) determining whether the binding partner effects an activity of the 
polypeptide. 

2 1 . The gene corresponding to the cDN A sequence of SEQ ID NO: Y. 

22. A method of identifying an activity in a biological assay, wherein the 
method comprises: 

(a) expressing SEQ ID NO:X in a cell; 

(b) isolating the supematant; 

(c) detecting an activity in a biological assay; and 

(d) identifying the protein in the supematant having the activity. 

23. The product produced by the method of claim 22. 
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JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


a 


CGte dlvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


FT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian federation 






DE 


Gennany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 
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